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OS/400 globalization 


As companies integrate e-commerce on a global scale into their fundamental business processes, their 
prospective customers, established customers, and active partners can take advantage of increased 
revenue and decreased expenses through software globalization. They also can improve customer 
communications and increase savings. Globalized software gives you the following advantages: 


Increased customer satisfaction that can increase sales 
Enhanced customer support communications 

Enhanced global information dissemination 

A better return on Information Technology (IT) investments 


This information shows you how to: 


Create an application efficiently and at minimal expense. 

Retrofit existing applications for globalization and create new applications designed for globalization. 
Designing an application for globalization, however, is usually less expensive than retrofitting an existing 
application. 

Ensure that the application design does not interfere with the current or planned design of other 
internationalized applications. 


These pages provide a single source for the information you_need to build applications for national and 
international audiences. You can also find information about in this release and how you can 


Globalization overview 

This topic describes the way that globalization has been implemented on OS/400", including topics 
that describe globalization-specific values on the system and topics that describe how services and 
functions in OS/400 support globalization. 


Set up OS/400 with a national language version 

This topic describes the steps you need to take to properly install and configure a national language 
version on OS/400, with topics ranging from selecting and installing hardware, installing software, 
and configuring your environment to run in a globalized setting. You can use this information as you 
install your own servers, and you can apply the principles when you develop applications for 
customers who are installing their own national language version on OS/400. 


Develop global applications 
This topic provides guidelines for designing, developing, and delivering globalized applications: 


* Designing functions that are sensitive to national languages 
¢ Supporting various types of hardware support 

* Translating the textual data in your application 

¢ Making your application available worldwide 
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This topic describes the ways in which OS/400 enables you to handle data in a globalized 
environment. Included in these pages are topics that describe Unicode and UCS-2 data, the Chinese 
standard GB18030, how to use CCSIDs to integrate multiple language environments consistently, 


and how to use bidirectional data, DBCS data, and locales. 


Globalization category. 
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Globalization checklists 

This topic gathers together all of the checklists that are contained with these pages. These checklists 
are useful reminders of the issues you need to consider as you create and work with global 
applications. 


Handle data in globalized applications 


One of the most critical challenges you will face as you work with globalized servers and applications is 
the effective interaction with data. OS/400 provides a wide range of options that you can use to insure that 
data is viewed and processed seamlessly across national languages. The following topics describe 
globalization as it affects how you handle your data: 


¢ Work with Unicode and UCS-2 data 


Pa 
GB18030: The Chinese standard 


< 
¢ Work with CCSIDs 
¢ Work with bidirectional data 
¢ Work with DBCS data 
¢ Work with locales 


Work with Unicode and UCS-2 data 


Unicode 

Prior to Unicode, the encoding systems that existed did not cover all the necessary numbers, characters, 
and symbols in use. Different encoding systems might assign the same number to different characters. If 
you used the wrong encoding system, your output might not have been what you expected to see. 


Unicode provides a unique number for every character, regardless of platform, language, or program. 
Using Unicode, you can develop a software product that will work with various platforms, languages, and 
countries. Unicode also allows data to be transported through many different systems. 


The following topics describe the Unicode implementation on OS/400: 
- (UTE-a) 
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For more information about Unicode, see the [Unicodd 
web page. 


UTF-8: A Unicode transformation format (UTF) is the algorithmic mapping from every Unicode value to a 
unique byte sequence. UTF-8 converts (via an algorithm) Unicode data so that it: 


* No longer have nulls in it 
* Uses 8 data bits to encode the data 
* Keeps all ASCII codes from 00 to 7F as encoded as themselves 


For example, the string “ABC” in Unicode would be "004100420043"x. However, in UTF-8 it would be 
"414243". 


Since UTF-8 allows Unicode data to flow over an 8-bit network without the network needing to know that it 
is Unicode, UTF-8 is used to store Unicode on several UNIX platforms and is used as the default encoding 
for most new internet standards. 


OS/400 supports UTF-8 encoding with CCSID 1208. 


UTF-16: 
a 


UTF-16 is an encoding of Unicode in which each character is composed of either one or two 16-bit 
elements. 


OS/400 supports UTF-16 encoding with CCSID 1200. 


Unicode was originally designed as a pure 16-bit encoding, aimed at representing all modern scripts. Over 
time, and especially after the addition of over 14,500 composite characters for compatibility with legacy 
sets, it became clear that 16 bits were not sufficient for most users. Out of this arose UTF-16. 


UTF-16 allows access to about 60,000 characters as single Unicode 16-bit units. It can access an 
additional 1,000,000 characters by a mechanism known as surrogate pairs. 


Two ranges of Unicode code values are reserved for the high (first) and low (second) values of these 
pairs. Highs are from OxD800 to OxDBFF, and lows from OxDCO00 to OxDFFF. Since the most common 
characters have already been encoded in the first 64,000 values, the characters requiring surrogate pairs 
are relatively rare. 


For more information about UTF-16, see the 
web page. 


% 


UTF-32: 
a 


UTF-32 is an encoding of Unicode in which each character is composed of 4 bytes. 


OS/400 does not support UTF-32 encoding with a CCSID value. 
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Unicode was originally designed as a pure 16-bit encoding, aimed at representing all modern scripts. Over 
time, and especially after the addition of over 14,500 composite characters for compatibility with legacy 
sets, it became clear that 16 bits were not sufficient for many users. Out of this arose UTF-32. 


UTF-32 allows characters to be encoded as 4 bytes at any code point from 00000000 to 0010FFFF. For 
example, the string ABC in UTF-32 would be encoded as x"000000410000004200000043". 


For more information about UTF-32, see the Unicodd 


od 
web page. 
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How Unicode relates to prior standards such as ASCII and EBCDIC: 
o 


This topic provides a historical perspective on the Unicode standard, and explains how it can reduce the 
complexity of handling character data in global applications. 


Evolving standards based on limited platforms 


The representation of character data in modern computer systems can be fairly complicated, depending on 
the needs of your global application. One of the reasons for this complexity is that the methods for 
handling this data have evolved from early methods that served less complicated environments and 
hardware platforms. 


In fact, many early decisions about how to encode characters on a system were guided by the functional 
requirements of specific devices, such as the early Telex (TTY) terminals and punch card technologies. 
For example, the Delete character (with an ASCII value of x’7F’) was required in order to punch out all of 
the holes in a column of a punch card to signify that the column should be ignored. The storage capacities 
of these early computing systems placed additional limitations on system and application designers. 


The character encoding schemes that have grown out of these early systems were built upon this 
historical foundation: 


¢ The ASCII (American Standard Code for Information Interchange) character set uses 7-bit units, with a 
trivial encoding designed for 7-bit bytes. It is the most important character set in use today, despite its 
limitation to very few characters, because its design is the foundation for most modern character sets. 
ASCII provides only 128 numeric values, and 33 of those are reserved for special functions. 


¢ The EBCDIC (Extended Binary-Coded Decimal Interchange Code) character set and a number of 
associated character sets, designed by IBM for its mainframes, uses 8-bit bytes. It was developed at a 
similar time as ASCII, and shares the same set of base characters and has other similar properties. 
Unlike ASCII, the Latin letters are not combined in two blocks for upper- and lower-case. Instead, the 
letters are arranged so that their hexadecimal values have second digits of 1 through 9 (another punch 
card-friendly design). 


Historical simplicity creates modern complexity 

The physical and functional limitations of the early character sets gave way to rapidly expanding hardware 
and functional capabilities. Character representation on computing systems became less dependent on 
hardware; instead, software designers used the existing encoding schemes to accommodate the needs of 
an increasingly global community of computer users. 


Character sets for many characters 
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The most common encodings (character encoding schemes) use a single byte per character, and they are 
often called single-byte character sets (SBCS). They are all limited to 256 characters. Because of this, 
none of them can even cover all of the accented letters for the Western European languages. 
Consequently, many different such encodings were created over time to fulfill the needs of different user 
communities. The most widely used SBCS encoding today, after ASCII, is ISO-8859-1. It is an 8-bit 
superset of ASCII and provides most of the characters necessary for Western Europe. 


However, East Asian writing systems needed a way to store over 10,000 characters, and so double-byte 
character sets (DBCS) were developed to provide enough space for the thousands of ideographic 
characters in East Asian writing systems. Here, the encoding is still byte-based, but each two bytes 
together represent a single character. 


Even in East Asia, text contains letters from small alphabets like Latin or Katakana. These are represented 
more efficiently with single bytes. Multi-byte character sets (MBCS) provide for this by using a variable 
number of bytes per character, which distinguishes them from the DBCS encodings. MBCSs are often 
compatible with ASCII; that is, the Latin letters are represented in such encodings with the same bytes that 
ASCII uses. Some less often used characters may be encoded using three or even four bytes. 


An important feature of MBCSs is that they have byte value ranges that are dedicated for lead bytes and 
trail bytes. Special ranges for lead bytes, the first bytes in multibyte sequences, make it possible to decide 
how many bytes belong together to encode a single character. Traditional MBCS encodings are designed 
so that it is easy to go forwards through a stream of bytes and read characters. However, it is often 
complicated and very dependent on the properties of the encoding to go backwards in text: going 
backwards, it is often hard to find out which variable number of bytes represents a single character, and 
sometimes it is necessary to go forward from the beginning of the text to do this. 


Examples of commonly used MBCS encodings are Shift-JIS and EUC-JP (for Japanese), with up to two 
and three bytes per character, respectively. 


Stateful encodings 


Some encodings are stateful; they have bytes or byte sequences that switch the meanings of the following 
bytes. Simple encodings, like mixed-byte EBCDIC, use Shift-In and Shift-Out control characters (bytes) to 
switch between two states. Sometimes, the bytes after a Shift-In are interpreted as a certain SBCS 
encoding, and the bytes after a Shift-Out as a certain DBCS encoding. This is very different from an 
MBCS encoding where the bytes for each character indicate the length of the byte sequence. 


The most common stateful encoding is ISO 2022 and its language-specific variations. It uses Escape 
sequences (byte sequences starting with an ASCII Escape character, byte value 27) to switch between 
many different embedded encodings. It can also announce encodings that are to be used with special 
shifting characters in the embedded byte stream. Language-specific variants like ISO-2022-JP limit the set 
of embeddable encodings and specify only a small set of acceptable Escape sequences for them. 


Such encodings are very powerful for data exchange but hard to use in an application. Their flexibility 
allows you to embed many other encodings, but direct use in programs and conversions to and from other 
encodings are complicated. For direct use, a program has to keep track not only of the current position in 
the text, but also of the state—which embeddable encoding is currently active—or must be able to 
determine the state for a position from considerable context. For conversions to other encodings, 
converting software may need to have mappings for many embeddable encodings, and for conversions 
from other encodings, special code must figure out which embeddable encoding to choose for each 
character. 


Unicode: The last character set? 


The Unicode standard specifies a character set and several encodings. As of early 2002, it contains 
almost 94000 characters, which include all the characters of the common character sets that were in use 
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when Unicode was started around 1990, plus many that have been added since. It is an open character 
set, which means that it keeps growing and adding less frequently used characters. 


The standard assigns numbers from 0 to Ox10FFFF, which is more than a million possible numbers for 
characters. About 5% of this space is used. Another 5% is in preparation, about 13% is reserved for 
private use (anyone can place any character in there), and about 2% is reserved and not to be used for 
characters. The remaining 75% is open for future use but not by any means expected to be filled up. In 
other words, there is finally a character set with plenty of space! 


Unicode is in use today, and it is the preferred character set for the Internet, especially for HTML and 
XML. It is slowly being adopted for use in e-mail, too. Its most attractive property is that it covers all the 
characters of the world (with exceptions, which will be added in the future). Unicode makes it possible to 
access and manipulate characters by unique numbers—their Unicode code points—and use older 
encodings only for input and output, if at all. 


Why Unicode? 


Hundreds of encodings have been developed, each for small groups of languages and special purposes. 
As a result, the interpretation of text, input, sorting, display, and storage depends on the knowledge of all 
the different types of character sets and their encodings. Programs are written to either handle one single 
encoding at a time and switch between them, or to convert between external and internal encodings. 


Part of the problem is that there is no single, authoritative source of precise definitions of many of the 
encodings and their names. Transferring of text from one machine to another one often causes some loss 
of information. Also, if a program has the code and the data to perform conversion between a significant 
subset of traditional encodings, then it carries several megabytes of data around. 


Unicode provides a single character set that covers the languages of the world, and a small number of 
machine-friendly encoding forms and schemes to fit the needs of existing applications and protocols. It is 
designed for best interoperability with both ASCII and ISO-8859-1, the most widely used character sets, to 
make it easier for Unicode to be used in applications and protocols. 


Unicode encodings 


For single characters, 32-bit integer variables are most appropriate for the value range of Unicode. For 
strings, however, storing 32 bits for each character takes up too much space, especially considering that 
the highest value, Ox1OFFFF, takes up only 21 bits. 11 bits are always unused in a 32-bit word storing a 
Unicode code point. Therefore, you will find that software generally uses 16-bit or 8-bit units as a 
compromise, with a variable number of code units per Unicode code point. It is a trade-off between ease 
of programming and storage space. 


As a result, there are three common ways to store Unicode strings: 

¢ UTF-32, with 32-bit code units, each storing a single code point 

¢ UTF-16, with one or two 16-bit code units for each code point 

¢ UTF-8, with one to four 8-bit code units (bytes) for each code point 


UTF-8 is used mainly as a direct replacement for older MBCS encodings which all use 8-bit code units, 
but it takes some more code to process it. It is a good encoding if 90% of your data is English, since all 
English letters use only one byte. 


UTF-16 is extremely well designed as the best compromise between handling and space, and all 
commonly used characters can be stored with one code unit per code point. This is the default encoding 
for Unicode. 


% 
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International Components for Unicode: The International Components for Unicode (ICU) is a C library 
that provides a full-featured, industrial strength, Unicode support. The library provides: 


* Calendar support 

¢ Character set conversions 

* Collation (language-sensitive) 

* Date and time formatting 

¢ Locales (140+ supported) 

¢ Message catalogs (resources) 

* Message formatting 

* Normalization 

¢ Number and currency formatting 
* Time zones 

* Transliteration 

¢ Word, line, and sentence breaks 


ICU is a collaborative, open-source development project jointly managed by a group of companies and 
individual volunteers throughout the world, using the Internet and the Web to communicate, plan, and 
develop the software and documentation. 


The ICU project is licensed under the IBM Public License, which has been approved by the Open Source 


Initiative. For more information, see international Components for Unicodd 
(http://oss.software.ibm.com/icu/). 


UCS-2 and its relationship to Unicode 


2 

Since the UCS-2 standard is limited to 65,535 characters, and the data processing industry needs over 
94,000 characters, the UCS-2 standard is in the process of being superseded by the TAP UTE-14 
standard. 


However, because UTF-16 is a superset of the existing UCS-2 standard, you can develop your 
applications using the systems existing UCS-2 support as long as your applications treat the UCS-2 as if it 
were UTF-16. 
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UCS (Universal Multiple-Octet Coded Character Set) 


The ISO 10646 standard is a character code designed to encode text for storage in computer files. The 
design of the ISO 10646 standard is based on today’s prevalent character code, ASCII (and ISO 8859-1, 
an extended version of the ASCII code). But ISO 10646 goes beyond ASCIl’s ability to encode only the 
Latin alphabet. The ISO 10646 encoding provides the capability to encode all of the characters used for 
written languages throughout the world. 


Two UCS encoding schemes 


In order to accommodate the many thousands of characters used in international text, ISO/IEC 10646 

specifies the Universal Multiple-Octet Coded Character Set (UCS). UCS can be implemented through two 

encoding schemes: 

¢ UCS-2: Each character is represented by 16 bits or 2 bytes. (The number 2 in UCS-2 indicates 2 
bytes.) For example, uppercase A is represented by 0041. 
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¢ UCS-4: Each character is represented by 32 bits or 4 bytes. (The number 4 in UCS-4 indicates 4 
bytes.) For example, uppercase A is represented by 0000 0041. 


The major difference between the 2-byte and 4-byte representation is that the 4-byte representation allows 
for the presentation or use of additional characters beyond the capability of UCS-2. That is, you can 
encode more characters in UCS-4 than you can in UCS-2. 


Benefits of UCS over ASCII 


UCS provides codes for more than 65,000 characters, a huge increase over ASCII’s 7-bit code capacity of 
128 characters. To keep character coding simple and efficient, the UCS-2 standard assigns each character 
a unique 16-bit value, and does not use complex modes or escape codes to specify modified characters or 
special cases. This simplicity and efficiency makes it easy for computers and software to handle ISO 
10646-encoded text files. 


UCS-2 allows for the use of "combining characters”. A combining character is a non-spacing character that 
is used together with a non-combining character to form a composite character, or glyph. For example, 
Latin small letter A used with a combining tilde results in 


a 


Within UCS-2 and UCS-4, characters can be presented or used at various levels. The levels and their 
descriptions are: 


¢ Level 1: No use of combining characters is allowed. 
¢ Level 2: Limited use of combining characters is allowed. 
¢ Level 3: No restriction on use of combining characters. 


The following topics provide more detailed information about UCS-2 support on OS/400: 
¢ Why use UCS-2? 

* UCS-2 on OS/400 

¢ UCS-2 level-1 mapping tables 


Why use UCS-2?: OS/400 provides multilingual support. UCS-2 provides the means to store and retrieve 
data in the user’s national language of choice in a single file and therefore provides for one database file 
to support all text needs, regardless of the language of the input device. For example, the same parts file 
could have Greek, Russian, and English descriptions and names in it. 


Mapping of data 


OS/400 uses the EBCDIC encoding scheme. However, not all clients attached to it use an EBCDIC 
encoding scheme to store, retrieve, and process data. For example, some clients may use ASCII, PC 
DATA, or other encoding schemes. Using UCS-2 prevents the loss of data due to incomplete conversion 
between encoding schemes and code pages. Therefore, some clients use UCS-2 as an “exchange 
mechanism” that is safe across all platforms. 


Examples: 


The following examples highlight two users on the same system. One user is English and the other Greek. 
The English user has his display device CCSID set to 37. The Greek user has his display device CCSID 
set to 875. Both users query, update, and replace data in the DATABASE1. DATABASE‘1 is tagged with 
CCSID 37. 
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Example: Display data without UCS-2: Problems with data integrity develop because users are operating 
with CCSIDs that have varied character support. That is, not all characters in CCSID 37 are available in 
CCSID 875 and vice-versa. 


Assume that the following names are to be entered by the English-speaking user (display device supports 
a CCSID of 37): 


o 


A 


alson 
° Gifford 


When these entries are stored, the data integrity remains intact. That is, an 


o 


A 
is stored as an 


° 


A 
. This is because the display device CCSID and the database CCSID are both 37. 


Assume the following names are also input into DATABASE1 by the Greek-speaking user (display device 
CCSID of 875): 


o 


A 


Tt 


ines 


DATABASE1 now consists of the following logical entries: 


o 


A 


alson 
° Gifford 
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On 


H 


a 


The Greek characters that make up the name are stored as those characters only if the same character 
exists within CCSID 37. If the character does not exist, the server converts the characters using a 
predetermined algorithm to a code point from code page 37. The algorithm converts 


Q 


to 


° 


A 


The following list shows the code point used to store the first character of each name in DATABASE1. 
(Using only the first character makes the example easier by eliminating long strings of code points which 
would be shown if we presented the code point for each character in the name.) 


Name CCSID 37 Stored Code Point (Hexadecimal) 


e 


A 
alson 67... 


Gifford 
C7... 


laens 
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53... 
Q 
Pp 
L 
W 
a 
Of bat 


The next step in this example is to show how data can be incorrectly selected due to the character 


conversion when it was stored in the database. 


Assume the Greek user wants to find all names beginning with 


Q 


. The following SQL statement would provide two names: 


Q 


alson 
Select from DATABASE] where name LIKE '%' 


The search yielded an unexpected name ( 


° 


A 


alson). This is because the first character in 


o 


A 


alson is stored with the same code point as the first character in 


Q 
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Example: Display data with UCS-2: In this example, using UCS-2 as the CCSID of DATABASE1, we can 
show how data integrity is maintained both in storing and retrieving data. As in the previous example, one 
user is English using CCSID 37 and the other user is Greek using CCSID 875. 


We’ll use DATABASE1 as in the previous example. However DATABASE1 is now defined with CCSID 
13488. (13488 is a UCS-2 CCSID.) 


° 


A 


alson 
° Gifford 


M 


laees 


H 


a 


The key difference in using UCS-2 as the CCSID of DATABASE1 is that data integrity is maintained for 
each user who inputs data to the database. That is each character, regardless of the CCSID of the 
inputting device, is stored with a unique code point. (Remember that in this example the CCSID of 
DATABASE1 is 13488.) 
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Name CCSID 13488 Stored Code Point (Hexadecimal) 


° 


A 
alson O00C5... 


Gifford 
0047... 


Mn 


O3A9... 


039C ... 


Assume the Greek user wants to find all names beginning with 


Q 


. The following SQL statement would provide one name, 


Q 


Q 


H 


a 


, aS compared to two in the previous example: 
Select from DATABASE1 where Substr(name,1,1) = '' 
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The reason for this is that each character stored in a UCS-2 tagged database has a unique code point. 
This contrasts to the previous exampld that had the first character in 


° 


A 


alson stored with the same code point as the first character in 


Q 


mu 


a 


UCS-2 on OS/400: OS/400 supports UCS-2 and implements its UCS-2 conversion support using level 1 
support. That is, no use of combining characters is mapped. 


a 
The coded character set identifier (CCSID) 13488 on OS/400 represents UCS-2. 


UCS-2 cannot be specified as a value for: 


= 


¢ The system CCSID 
¢ A.user profile CCSID 
¢ Ajob CCSID 


OS/400 provides external support for UCS-2 in the following parts of the system (see note, below): 
* Database files and functions 


* Display file and panel groups 

* [Sort sequenced 

¢ UCS-2 variables in UIM 

¢ ILE high-level languages such as RPG 

* Message handling and message catalogs 


Several other OS/400 functions implement UCS-2 internally so that character data integrity is maintained 
for users across multilingual platforms. 


Note: These topics do not give detailed information on application development as it relates to the 


implementation of UCS-2. Rather, they provide highlights of OS/400 support for UCS-2. Where possible, 
reference to a book that provides detailed information for UCS-2 implementation is given. 


a 
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You should have available and understand the information in the Unicode standard. 


% 


For more information about Unicode, see the [Unicodd 

web page. 

Database files and functions: When you create UCS-2 database applications, you need to consider the 
implications for creating physical files (see page , creating logical files (see page Lh, and performing 
input and output on the database files (see page i 


Creating physical files: 


UCS-2 graphic fields can be created in physical files. This is done by specifying a G data type anda 
UCS-2 CCSID for the CCSID keyword. 


The following example shows the DDS for a physical file containing four fields, and the command for 
creating the file: 


A R FMT1 

A EMPNO 6A 

A NAME 306 CCSID(13488) 

A DESCR1 500G CCSID(13488) VARLEN 
A DESCR2 500A 


CRTPF FILE(UCS-2PF) SRCFILE(CLR/QDDSSRC) 


In the example: 

* The first field, EMPNO, is a character field of length 6. The CCSID of the EMPNO field is the SBCS 
CCSID of the job. The decision was made to use a character field because the EMPNO field contains 
only numerics and UCS-2 support is not needed. 

¢ The NAME and DESCR1 fields are both UCS-2 fields. Both of these fields may need to contain data from 
more than one EBCDIC code page so the decision was made to make these fields UCS-2 graphic. 


¢ The DESCR2 field is the SBCS CCSID of the job. This field is used as illustratioin of mapping to a logical 
field in Creating logical files (see page [15). 


You can specify the default (DFT) keyword for UCS-2 graphic fields. The default value can be specified as 
SBCS, bracketed-DBCS, or bracketed-DBCS-graphic character strings. If you do not specify the DFT 
keyword, the default value for fixed-length UCS-2 fields is the UCS-2 blank (hexadecimal 0020). For 
varying-length UCS-2 fields, the default is the empty string. 


Creating logical files: 


You can use logical files to map UCS-2 data to and from character, DBCS-open, or DBCS-graphic. This 
allows UCS-2 graphic data to be manipulated in a character based form. 


The following example shows the DDS for a logical file containing 4 character fields. The UCS-2 graphic 
data is converted to character data when reading from the logical file, and character data is converted to 
UCS-2 graphic data when writing to the file. 


R FMT1 PFILE(UCS2PF1) 

A EMPNO 

A NAME A CCSID(37) 

A DESCR1 A CCSID(37) 

A DESCR2 G CCSID(13488) 
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Database input/output: 


Whenever reading or writing data from or to a field tagged with a UCS-2 CCSID to the job physical files, 
the data is passed as UCS-2 data without any conversions occurring. Regardless of the job CCSID, data 
is passed as UCS-2 data. When writing data to a logical file, the from CCSID is the job CCSID; however, if 
the job CCSID is 65534, the from CCSID is the CCSID of the field in the logical file. 


The following are some scenarios from the physical and logical files listed above. For the scenarios, the 
job CCSID is 297. 


Scenario 1. When reading the data from the physical file: 
¢ EMPNO is converted from its CCSID to 297. 

¢ NAME is not converted but is left as UCS-2 data. 

* DESCR1 is not converted but is left as UCS-2 data. 

* DESCR2 is converted from its CCSID to 297. 


Scenario 2. When writing the data to the physical file: 
* EMPNO is converted from 297 to its CCSID. 

¢ NAME is not converted but is left as UCS-2 data. 

¢ DESCR1 is not converted but is left as UCS-2 data. 
* DESCR2 is converted from 297 to its CCSID. 


Scenario 3. When reading the data from the logical file: 

¢ EMPNO is converted from its CCSID to 297. 

¢ NAME is converted from UCS-2 data to character data with a CCSID of 297. 

¢ DESCR1 is converted from UCS-2 data to character data with a CCSID of 297. 

¢ DESCR2 is converted from character data to UCS-2 data and not converted to the job CCSID. 


Scenario 4. When writing the data to the logical file: 

¢ EMPNO is converted from 297 to its CCSID. 

¢ NAME is converted from 297 to UCS-2 data. 

¢ DESCR1 is converted from 297 to UCS-2 data. 

¢ DESCR2 is converted from UCS-2to its CCSID in the physical file. 


Scenario 5. If the job was 65535, the conversions for the above fields would be: 
¢ EMPNO is not converted. 

¢ NAME is converted from 37 to UCS-2 data. 

¢ DESCR1 is converted from 37 to UCS-2 data. 

* DESCR2 is converted from UCS-2 to its CCSID in the physical file. 


DB2 UDB for iSeries: Keep the following in mind when using DB2 UDB for iSeries applications: 


¢ Implicit conversion when comparing UCS-2 fields with character/IGC/graphic fields as well as with 
literals and host variables can occur. 


¢ Physical and logical files with UCS-2 fields cannot have their CCSIDs changed with the Change 
Physical File (cHGPE) command. 


¢ A UCS-2 CCSID is not allowed on the CHGPF command. 


* The Copy File (CPYe) and Copy From Query File (CPYERMORYE) commands with FMTOPT(*MAP) 
specified is not allowed when copying from or to a UCS-2 graphic field unless: 


— the corresponding field is a UCS-2 or DBCS-graphic field. 
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— the corresponding field is a character, DBCS-open, DBCS-either, or DBCS-only field with a CCSID 
other than 


¢ The Copy File command supports copying of SBCS character, DBCS-open, DBCS-only, 
DBCS-either, and DBCS-graphic fields to and from UCS-2 graphic fields. There is limited support for 
UCS-2 on the FROMKEY, TOKEY, INCCHAR, and INCREL parameters. 


SQL tables: SQL supports tables that contain UCS-2 graphic columns by specifying a UCS-2 CCSID for 
the GRAPHIC and VARGRAPHIC data types. 


The following SQL example creates the table UCS2_TABLE. UCS2_TABLE contains one character column 
called EMPNO, and two UCS-2 graphic columns. NAME is a fixed-length UCS-2 graphic column and 
DESCRIPTION is a variable-length UCS-2 graphic column. The decision was made to use a character 
field since the EMPNO field only contains numerics and UCS-2 support is not needed. The NAME and 
DESCRIPTION fields are both UCS-2 fields. Both of these fields may contain data from more than one 
EBCDIC code page. 

CREATE TABLE UCS2_TABLE (EMPNO CHAR(6) NOT NULL, 


NAME GRAPHIC(30) CCSID 13488, 
DESCRIPTION VARGRAPHIC(500) CCSID 13488) 


Inserting data 


SBCS character, mixed character, and DBCS graphic data can be inserted into UCS-2 graphic columns 
using the SQL INSERT statement. DB2 UDB for iSeries SQL converts the data to UCS-2 graphic. In SQL 
programs, the DECLARE VARIABLE statement can be used to attach a UCS-2 CCSID to graphic host 
variables. 


The following SQL example converts character data to UCS-2 graphic for the NAME and DESCRIPTION 
columns and inserts the row into the UCS2_TABLE. 


INSERT INTO UCS2_TABLE VALUES('000001', ‘John Doe', 'Engineer') 
Selecting UCS-2 data 
Implicit conversion of UCS-2 graphic data is supported on a FETCH or select INTO and CALL. 


In the following example, the EMPNO column is returned in empno_hv as character data. The NAME 
column is returned in name_hv as UCS-2 graphic data because name_hv is a UCS-2 variable. It is not 
converted to character, mixed character, or DBCS graphic. 


char empno_hv[7]; 
wchar_t name_hv[31]; 
EXEC SQL DECLARE :name_hv VARIABLE CCSID 13488; 


EXEC SQL SELECT EMPNO, NAME 
INTO :empno_hv, :name_hv 
.FROM UCS2_TABLE; 


To return UCS-2 graphic data as EBCDIC data, the prior example could be changed to return the UCS-2 
data as character data, EMPNO and NAME are returned in the job CCSID. 


char empno_hv[7]; 
char name_hv[31]; 


EXEC SQL SELECT EMPNO, NAME 


INTO :empno_hv, :name_hv 
FROM UCS2_TABLE; 
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When doing selection, implicit conversions is done when comparing UCS-2 graphic data and character or 
DBCS graphic data. 


The following example converts the character string "John Doe’ to UCS-2 graphic and then selects the 
rows where the NAME column is ‘John Doe’. 


EXEC SQL DECLARE C1 CURSOR FOR 
SELECT * 

FROM UCS2_TABLE 

WHERE NAME = ‘John Doe'; 


For additional information on using SQL with UCS-2 graphic data, see the SQL Referencel topic in the 
Information Center. 


Query files and tools: Open query file (OPNQRYF) command considerations 


The Open Query File (OPNORY#) command, as shown below, can retrieve or perform selection of UCS-2 
data. Using the MAPFLD parameter, data can be mapped to or from UCS-2. 


OPNQRYF FILE(UCS2_TABLE) 
QRYSLT ('NAME=MAPNAME' ) 
MAPFLD((MAPNAME ‘John Doe! *GRAPHIC *N *N 13488)) 


Interactive query tools considerations 
Query for iSeries, DB2 Query Manager, and the DB2 Query Management function for OS/400 all have 


UCS-2 support. UCS-2 data can be displayed or printed on a report; by implicitly converting to either 
character or mixed art. 


For additional information, see the 


e 
and Query Management Programming 


e 


PDFs. 


Data description specifications (DDS): \n DDS, you use the CCSID file-, record-, or field-level keyword to 
specify that a G-type field supports UCS-2 data rater of DBCS-graphical data. See the CCSID keyword 
description in the DD al and Logica | topic. 


The following are DDS considerations for UCS-2 and OS/400 applications: 


* UCS-2 CCSID 13488 can be specified for graphic and variable graphic fields in physical files. UCS-2 
CCSID 61952 cannot be specified in physical files. 


¢ Logical files can be used to map from UCS-2 fields in the physical file to character (A or O) or DBCS 
graphic in the logical. Logical files can also be used to map character (A or O) or DBCS graphic in the 
physical file to UCS-2 graphic in the logical file. A CCSID can be specified in a DDS logical file. If the 
CCSID parameter is specified, the logical file is created using that CCSID. If a CCSID is not specified, 
the job default CCSID is used if mapping from UCS-2 to character is specified. 


If a logical file is used for I/O, fields are defined as character or DBCS graphic and the underlying 
physical fields are defined as UCS-2. On output the data is mapped directly from the job CCSID to 
UCS-2. Data will not map first from the job CCSID to the logical file CCSID and then from the logical file 
CCSID to the physical file UCS-2 CCSID. This mapping prevents data loss. On input, the UCS-2 data is 
mapped directly to the job CCSID. 
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¢ If a UCS-2 CCSID is specified at the file level and there are character fields defined for the file, the file 
can be created and the job default CCSID is used for the fields that do not have an explicit CCSID 
specified. 

* If the field has a UCS-2 CCSID and a user-specified default value is not specified, then the default is 
UCS-2 blanks (X’0020’) for fixed-length UCS-2 graphic and the empty string for varying-length UCS-2 
graphic. A user-specified default may be specified as either a character or graphic literal. This literal 
value is converted to UCS-2 by database and stored internally in UCS-2. 


Display files and panel groups: UCS-2 data is not supported on display devices that currently support the 
5250 data stream. Therefore, conversions between the UCS-2 data and EBCDIC are necessary during 
input/output operations. On output, the UCS-2 data is converted to the CCSID of the device. On input, the 
data is converted from the device CCSID to the UCS-2 CCSID. 


Since the device CCSID, which is determined from the device configuration, determines what the UCS-2 
data is converted to, the converted data appears differently on different devices. For example, a UCS-2 
character which maps to a SBCS character is displayed as a DBCS replacement character on a 
graphic-DBCS capable device. On a DBCS or SBCS capable device, the character appears as a SBCS 
character. A UCS-2 character which maps to a DBCS character is displayed as a graphic-DBCS character 
on a graphic-DBCS capable device. On a DBCS device, a DBCS character is bracketed (enclosed in a 
shift-out and shift-in). A SBCS replacement character is displayed on a SBCS device. 


It is also suggested that all UCS-2 capable fields are initialized in the output buffer before writing the fields 
to the screen. Unpredictable results may occur if default initialization is allowed to take place. 


For more information about display file and panel group considerations, see the IUCS-2 appendix in the 
DDS Reference: Physical and Logical topic. 


UCS-2 variables in UIM: The following example shows how to define a UCS-2 variable in UIM. 
1 :class name=example basetype='graphic 6 13488' width=10, 


2 

3 :class name=example2 basetype='graphic 10 13488' width=20. 

4 

Line 1 defines a class for variables that will contain 6 UCS-2 characters and is to be displayed in a field 
that is 10 bytes long. 


Line 3 defines a class for variables that will contain 10 UCS-2 characters and is to be displayed in a field 
that is 20 bytes long. 


For more information on UCS-2 and UIM, see the definition of the CLAS tag in the 


e 


PDF. 

UCS-2 level-1 mapping tables: You can convert characters encoded in universal coded character set 2 
level 1 (UCS-2 level-1) from uppercase to lowercase. The shows 
the mapping for this conversion. 


You can also convert UCS-2 level-1 characters from lowercase to uppercase. The [_owercase to uppercase) 
shows the mapping for this conversion. 


Use the Convert Casel API to perform these conversions. 
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ISO 10646 uppercase to lowercase UCS-2 level-1 conversion mapping: 


Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


0041 0061 LATIN CAPITAL LETTER A LATIN SMALL LETTER A 

0042 0062 LATIN CAPITAL LETTER B LATIN SMALL LETTER B 

0043 0063 LATIN CAPITAL LETTER C LATIN SMALL LETTER C 

0044 0064 LATIN CAPITAL LETTER D LATIN SMALL LETTER D 

0045 0065 LATIN CAPITAL LETTER E LATIN SMALL LETTER E 

0046 0066 LATIN CAPITAL LETTER F LATIN SMALL LETTER F 

0047 0067 LATIN CAPITAL LETTER G LATIN SMALL LETTER G 

0048 0068 LATIN CAPITAL LETTER H LATIN SMALL LETTER H 

0049 0069 LATIN CAPITAL LETTER | LATIN SMALL LETTER | 

004A 006A LATIN CAPITAL LETTER J LATIN SMALL LETTER J 

004B 006B LATIN CAPITAL LETTER K LATIN SMALL LETTER K 

004C 006C LATIN CAPITAL LETTER L LATIN SMALL LETTER L 

004D 006D LATIN CAPITAL LETTER M LATIN SMALL LETTER M 

004E OO6E LATIN CAPITAL LETTER N LATIN SMALL LETTER N 

004F OO6F LATIN CAPITAL LETTER O LATIN SMALL LETTER O 

0050 0070 LATIN CAPITAL LETTER P LATIN SMALL LETTER P 

0051 0071 LATIN CAPITAL LETTER Q LATIN SMALL LETTER Q 

0052 0072 LATIN CAPITAL LETTER R LATIN SMALL LETTER R 

0053 0073 LATIN CAPITAL LETTER S LATIN SMALL LETTER S 

0054 0074 LATIN CAPITAL LETTER T LATIN SMALL LETTER T 

0055 0075 LATIN CAPITAL LETTER U LATIN SMALL LETTER U 

0056 0076 LATIN CAPITAL LETTER V LATIN SMALL LETTER V 

0057 0077 LATIN CAPITAL LETTER W LATIN SMALL LETTER W 

0058 0078 LATIN CAPITAL LETTER X LATIN SMALL LETTER X 

0059 0079 LATIN CAPITAL LETTER Y LATIN SMALL LETTER Y 

005A 007A LATIN CAPITAL LETTER Z LATIN SMALL LETTER Z 

00CO OOEO LATIN CAPITAL LETTER A LATIN SMALL LETTER A GRAVE 
GRAVE 

00C1 00E1 LATIN CAPITAL LETTER A LATIN SMALL LETTER A GRAVE 
ACUTE 

00C2 00E2 LATIN CAPITAL LETTER A LATIN SMALL LETTER A GRAVE 
CIRCUMFLEX 

00C3 00E3 LATIN CAPITAL LETTER A LATIN SMALL LETTER A GRAVE 
TILDE 

00C4 00E4 LATIN CAPITAL LETTER A LATIN SMALL LETTER A GRAVE 
DIAERESIS 

00C5 OOE5 LATIN CAPITAL LETTER A RING | LATIN SMALL LETTER A GRAVE 

00C6 OOE6 LATIN CAPITAL LETTER AE LATIN SMALL LETTER A GRAVE 

00C7 00E7 LATIN CAPITAL LETTER C LATIN SMALL LETTER A GRAVE 


CEDILLA 
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Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


00C8 OO0E8 LATIN CAPITAL LETTER E LATIN SMALL LETTER A GRAVE 
GRAVE 

00C9 OOE9 LATIN CAPITAL LETTER E LATIN SMALL LETTER A GRAVE 
ACUTE 

O00CA OOEA LATIN CAPITAL LETTER E LATIN SMALL LETTER E 
CIRCUMFLEX CIRCUMFLEX 

00CB OOEB LATIN CAPITAL LETTER E LATIN SMALL LETTER E 
DIAERESIS DIAERESIS 

00CC OOEC LATIN CAPITAL LETTER | LATIN SMALL LETTER | GRAVE 
GRAVE 

00CD OOED LATIN CAPITAL LETTER | LATIN SMALL LETTER | ACUTE 
ACUTE 

OOCE OOEE LATIN CAPITAL LETTER | LATIN SMALL LETTER | 
CIRCUMFLEX CIRCUMFLEX 

OOCF OOEF LATIN CAPITAL LETTER | LATIN SMALL LETTER | 
DIAERESIS DIAERESIS 

00DO OOFO LATIN CAPITAL LETTER ETH LATIN SMALL LETTER ETH 

ooD1 OOF 1 LATIN CAPITAL LETTER N LATIN SMALL LETTER N TILDE 
TILDE 

00D2 OOF2 LATIN CAPITAL LETTER O LATIN SMALL LETTER O GRAVE 
GRAVE 

00D3 OOF3 LATIN CAPITAL LETTER O LATIN SMALL LETTER O ACUTE 
ACUTE 

00D4 OOF4 LATIN CAPITAL LETTER O LATIN SMALL LETTER O 
CIRCUMFLEX CIRCUMFLEX 

00D5 OOF5 LATIN CAPITAL LETTER O LATIN SMALL LETTER O TILDE 
TILDE 

00D6 OOF6 LATIN CAPITAL LETTER O LATIN SMALL LETTER O 
DIAERESIS DIAERESIS 

00D8 OOF8 LATIN CAPITAL LETTER O LATIN SMALL LETTER O SLASH 
SLASH 

00D9 OOF9 LATIN CAPITAL LETTER U LATIN SMALL LETTER U GRAVE 
GRAVE 

OODA OOFA LATIN CAPITAL LETTER U LATIN SMALL LETTER U ACUTE 
ACUTE 

0ODB OOFB LATIN CAPITAL LETTER U LATIN SMALL LETTER U 
CIRCUMFLEX CIRCUMFLEX 

00DC OOFC LATIN CAPITAL LETTER U LATIN SMALL LETTER U 
DIAERESIS DIAERESIS 

0OoDD OOFD LATIN CAPITAL LETTER Y LATIN SMALL LETTER Y ACUTE 
ACUTE 

OODE OOFE LATIN CAPITAL LETTER THORN | LATIN SMALL LETTER THORN 

0100 0101 LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
MACRON MACRON 

0102 0103 LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 


BREVE 


BREVE 
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Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


0104 0105 LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
OGONEK OGONEK 

0106 0107 LATIN CAPITAL LETTER C LATIN SMALL LETTER C WITH 
WITH ACUTE ACUTE 

0108 0109 LATIN CAPITAL LETTER C LATIN SMALL LETTER C WITH 
WITH CIRCUMFLEX CIRCUMFLEX 

010A 010B LATIN CAPITAL LETTER C LATIN SMALL LETTER C WITH 
WITH DOT ABOVE DOT ABOVE 

010C 010D LATIN CAPITAL LETTER C LATIN SMALL LETTER C WITH 
WITH CARON CARON 

010E 010F LATIN CAPITAL LETTER D LATIN SMALL LETTER D WITH 
WITH CARON CARON 

0110 0111 LATIN CAPITAL LETTER D LATIN SMALL LETTER D WITH 
WITH STROKE STROKE 

0112 0113 LATIN CAPITAL LETTER E WITH|LATIN SMALL LETTER E WITH 
MACRON MACRON 

0114 0115 LATIN CAPITAL LETTER E WITH |LATIN SMALL LETTER E WITH 
BREVE BREVE 

0116 0117 LATIN CAPITAL LETTER E WITH|LATIN SMALL LETTER E WITH 
DOT ABOVE DOT ABOVE 

0118 0119 LATIN CAPITAL LETTER E WITH|LATIN SMALL LETTER E WITH 
OGONEK OGONEK 

011A 011B LATIN CAPITAL LETTER E WITH |LATIN SMALL LETTER E WITH 
CARON CARON 

011C 011D LATIN CAPITAL LETTER G LATIN SMALL LETTER G WITH 
WITH CIRCUMFLEX CIRCUMFLEX 

O11E O11F LATIN CAPITAL LETTER G LATIN SMALL LETTER G WITH 
WITH BREVE BREVE 

0120 0121 LATIN CAPITAL LETTER G LATIN SMALL LETTER G WITH 
WITH DOT ABOVE DOT ABOVE 

0122 0123 LATIN CAPITAL LETTER G LATIN SMALL LETTER G WITH 
WITH CEDILLA CEDILLA 

0124 0125 LATIN CAPITAL LETTER H LATIN SMALL LETTER H WITH 
WITH CIRCUMFLEX CIRCUMFLEX 

0126 0127 LATIN CAPITAL LETTER H LATIN SMALL LETTER H WITH 
WITH STROKE STROKE 

0128 0129 LATIN CAPITAL LETTER | WITH |LATIN SMALL LETTER | WITH 
TILDE TILDE 

012A 012B LATIN CAPITAL LETTER | WITH |LATIN SMALL LETTER | WITH 
MACRON MACRON 

012C 012D LATIN CAPITAL LETTER | WITH |LATIN SMALL LETTER | WITH 
BREVE BREVE 

012E 012F LATIN CAPITAL LETTER | WITH |LATIN SMALL LETTER | WITH 
OGONEK OGONEK 

0130 0069 LATIN CAPITAL LETTER | WITH |LATIN SMALL LETTER | 
DOT ABOVE 

0132 0133 LATIN CAPITAL LIGATURE lJ LATIN SMALL LIGATURE IJ 
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Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


0134 0135 LATIN CAPITAL LETTER J WITH | LATIN SMALL LETTER J WITH 
CIRCUMFLEX CIRCUMFLEX 

0136 0137 LATIN CAPITAL LETTER K WITH | LATIN SMALL LETTER K WITH 
CEDILLA CEDILLA 

0139 013A LATIN CAPITAL LETTER L WITH | LATIN SMALL LETTER L WITH 
ACUTE ACUTE 

013B 013C LATIN CAPITAL LETTER L WITH | LATIN SMALL LETTER L WITH 
CEDILLA CEDILLA 

013D 013E LATIN CAPITAL LETTER L WITH | LATIN SMALL LETTER L WITH 
CARON CARON 

013F 0140 LATIN CAPITAL LETTER L WITH | LATIN SMALL LETTER L WITH 
MIDDLE DOT MIDDLE DOT 

0141 0142 LATIN CAPITAL LETTER L WITH | LATIN SMALL LETTER L WITH 
STROKE STROKE 

0143 0144 LATIN CAPITAL LETTER N LATIN SMALL LETTER N WITH 
WITH ACUTE ACUTE 

0145 0146 LATIN CAPITAL LETTER N LATIN SMALL LETTER N WITH 
WITH CEDILLA CEDILLA 

0147 0148 LATIN CAPITAL LETTER N LATIN SMALL LETTER N WITH 
WITH CARON CARON 

014A 014B LATIN CAPITAL LETTER ENG — |LATIN SMALL LETTER ENG (SAMI) 
(SAMI) 

014C 014D LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH MACRON MACRON 

014E 014F LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH BREVE BREVE 

0150 0151 LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH DOUBLE ACUTE DOUBLE ACUTE 

0152 0153 LATIN CAPITAL LIGATURE OE | LATIN SMALL LIGATURE OE 

0154 0155 LATIN CAPITAL LETTER R LATIN SMALL LETTER R WITH 
WITH ACUTE ACUTE 

0156 0157 LATIN CAPITAL LETTER R LATIN SMALL LETTER R WITH 
WITH CEDILLA CEDILLA 

0158 0159 LATIN CAPITAL LETTER R LATIN SMALL LETTER R WITH 
WITH CARON CARON 

015A 015B LATIN CAPITAL LETTER S WITH| LATIN SMALL LETTER S WITH 
ACUTE ACUTE 

015C 015D LATIN CAPITAL LETTER S WITH| LATIN SMALL LETTER S WITH 
CIRCUMFLEX CIRCUMFLEX 

015E O15F LATIN CAPITAL LETTER S WITH| LATIN SMALL LETTER S WITH 
CEDILLA CEDILLA 

0160 0161 LATIN CAPITAL LETTER S WITH| LATIN SMALL LETTER S WITH 
CARON CARON 

0162 0163 LATIN CAPITAL LETTER T WITH | LATIN SMALL LETTER T WITH 
CEDILLA CEDILLA 

0164 0165 LATIN CAPITAL LETTER T WITH | LATIN SMALL LETTER T WITH 


CARON 


CARON 
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Uppercase code Lowercase code Uppercase character 


point point description Lowercase character description 

0166 0167 LATIN CAPITAL LETTER T WITH | LATIN SMALL LETTER T WITH 
STROKE STROKE 

0168 0169 LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH TILDE TILDE 

016A 016B LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH MACRON MACRON 

016C 016D LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH BREVE BREVE 

016E 016F LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH RING ABOVE RING ABOVE 

0170 0171 LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH DOUBLE ACUTE DOUBLE ACUTE 

0172 0173 LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH OGONEK OGONEK 

0174 0175 LATIN CAPITAL LETTER W LATIN SMALL LETTER W WITH 
WITH CIRCUMFLEX CIRCUMFLEX 

0176 0177 LATIN CAPITAL LETTER Y WITH | LATIN SMALL LETTER Y WITH 
CIRCUMFLEX CIRCUMFLEX 

0178 OOFF LATIN CAPITAL LETTER Y WITH | LATIN SMALL LETTER Y 
DIAERESIS DIAERESIS 

0179 017A LATIN CAPITAL LETTER Z WITH | LATIN SMALL LETTER Z WITH 
ACUTE ACUTE 

017B 017C LATIN CAPITAL LETTER Z WITH | LATIN SMALL LETTER Z WITH 
DOT ABOVE DOT ABOVE 

017D 017E LATIN CAPITAL LETTER Z WITH | LATIN SMALL LETTER Z WITH 
CARON CARON 

0181 0253 LATIN CAPITAL LETTER B WITH | LATIN SMALL LETTER B WITH 
HOOK HOOK 

0182 0183 LATIN CAPITAL LETTER B WITH | LATIN SMALL LETTER B WITH 
TOPBAR TOPBAR 

0184 0185 LATIN CAPITAL LETTER TONE |LATIN SMALL LETTER TONE SIX 
SIX 

0186 0254 LATIN CAPITAL LETTER OPEN _|LATIN SMALL LETTER OPEN O 
O 

0187 0188 LATIN CAPITAL LETTER C LATIN SMALL LETTER C WITH 
WITH HOOK HOOK 

018A 0257 LATIN CAPITAL LETTER D LATIN SMALL LETTER D WITH 
WITH HOOK HOOK 

018B 018C LATIN CAPITAL LETTER D LATIN SMALL LETTER D WITH 
WITH TOPBAR TOPBAR 

018E 0258 LATIN CAPITAL LETTER LATIN SMALL LETTER REVERSED 
REVERSED E E 

018F 0259 LATIN CAPITAL LETTER LATIN SMALL LETTER SCHWA 
SCHWA 

0190 025B LATIN CAPITAL LETTER OPEN _|LATIN SMALL LETTER OPEN E 
E 
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Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


0191 0192 LATIN CAPITAL LETTER F WITH | LATIN SMALL LETTER F WITH 
HOOK HOOK 

0193 0260 LATIN CAPITAL LETTER G LATIN SMALL LETTER G WITH 
WITH HOOK HOOK 

0194 0263 LATIN CAPITAL LETTER LATIN SMALL LETTER GAMMA 
GAMMA 

0196 0269 LATIN CAPITAL LETTER IOTA LATIN SMALL LETTER IOTA 

0197 0268 LATIN CAPITAL LETTER | WITH |LATIN SMALL LETTER | WITH 
STROKE STROKE 

0198 0199 LATIN CAPITAL LETTER K WITH | LATIN SMALL LETTER K WITH 
HOOK HOOK 

019C 026f LATIN CAPITAL LETTER LATIN SMALL LETTER TURNED M 
TURNED M 

019D 0272 LATIN CAPITAL LETTER N LATIN SMALL LETTER N WITH 
WITH LEFT HOOK LEFT HOOK 

019F 0275 LATIN CAPITAL LETTER O LATIN SMALL LETTER BARRED O 
WITH MIDDLE TILDE 

01A0 O1A1 LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH HORN HORN 

01A2 01A3 LATIN CAPITAL LETTER Ol LATIN SMALL LETTER Ol 

01A4 01A5 LATIN CAPITAL LETTER P WITH | LATIN SMALL LETTER P WITH 
HOOK HOOK 

01A7 01A8 LATIN CAPITAL LETTER TONE |LATIN SMALL LETTER TONE TWO 
TWO 

01A9 0283 LATIN CAPITAL LETTER ESH LATIN SMALL LETTER ESH 

01AC 01AD LATIN CAPITAL LETTER T WITH | LATIN SMALL LETTER T WITH 
HOOK HOOK 

O1AE 0288 LATIN CAPITAL LETTER T WITH | LATIN SMALL LETTER T WITH 
RETROFLEX HOOK RETROFLEX HOOK 

O1AF 01BO LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH HORN HORN 

01B1 028A LATIN CAPITAL LETTER LATIN SMALL LETTER UPSILON 
UPSILON 

01B2 028B LATIN CAPITAL LETTER V WITH | LATIN SMALL LETTER V WITH 
HOOK HOOK 

01B3 01B4 LATIN CAPITAL LETTER Y WITH | LATIN SMALL LETTER Y WITH 
HOOK HOOK 

01B5 01B6 LATIN CAPITAL LETTER Z WITH | LATIN SMALL LETTER Z WITH 
STROKE STROKE 

01B7 0292 LATIN CAPITAL LETTER EZH LATIN SMALL LETTER EZH 

01B8 01B9 LATIN CAPITAL LETTER EZH LATIN SMALL LETTER EZH 
REVERSED REVERSED 

01BC 01BD LATIN CAPITAL LETTER TONE |LATIN SMALL LETTER TONE FIVE 
FIVE 

0104 01C6 LATIN CAPITAL LETTER DZ LATIN SMALL LETTER DZ WITH 


WITH CARON 


CARON 
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Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


01C5 01C6 LATIN CAPITAL LETTER D LATIN SMALL LETTER DZ WITH 
WITH SMALL LETTER Z WITH =| CARON 
CARON 

01C7 01C9 LATIN CAPITAL LETTER LJ LATIN SMALL LETTER LJ 

01C8 01C9 LATIN CAPITAL LETTER L WITH |LATIN SMALL LETTER LJ 
SMALL LETTER J 

01CA 01CC LATIN CAPITAL LETTER NJ LATIN SMALL LETTER NJ 

01CB 01CC LATIN CAPITAL LETTER N LATIN SMALL LETTER NJ 
WITH SMALL LETTER J 

01CD 01CE LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
CARON CARON 

01CF 01D0 LATIN CAPITAL LETTER | WITH |LATIN SMALL LETTER | WITH 
CARON CARON 

01D1 01D2 LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH CARON CARON 

01D3 01D4 LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH CARON CARON 

01D5 01D6 LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH DIAERESIS AND DIAERESIS AND MACRON 
MACRON 

01D7 01D8 LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH DIAERESIS AND ACUTE | DIAERESIS AND ACUTE 

01D9 01DA LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH DIAERESIS AND CARON § | DIAERESIS AND CARON 

01DB 01DC LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH DIAERESIS AND GRAVE_ | DIAERESIS AND GRAVE 

01DE 01DF LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
DIAERESIS AND MACRON DIAERESIS AND MACRON 

01E0 01E1 LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH DOT 
DOT ABOVE AND MACRON ABOVE AND MACRON 

01E2 01E3 LATIN CAPITAL LIGATURE AE LATIN SMALL LIGATURE AE WITH 
WITH MACRON MACRON 

01E4 O1E5 LATIN CAPITAL LETTER G LATIN SMALL LETTER G WITH 
WITH STROKE STROKE 

01E6 01E7 LATIN CAPITAL LETTER G LATIN SMALL LETTER G WITH 
WITH CARON CARON 

01E8 01E9 LATIN CAPITAL LETTER K WITH | LATIN SMALL LETTER K WITH 
CARON CARON 

O1EA 01EB LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH OGONEK OGONEK 

01EC 01ED LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH OGONEK AND MACRON_ | OGONEK AND MACRON 

O1EE O1EF LATIN CAPITAL LETTER EZH LATIN SMALL LETTER EZH WITH 
WITH CARON CARON 

O1F1 01F3 LATIN CAPITAL LETTER DZ LATIN SMALL LETTER DZ 

01F4 01F5 LATIN CAPITAL LETTER G LATIN SMALL LETTER G WITH 


WITH ACUTE 
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ACUTE 


Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


O1FA 01FB LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
RING ABOVE AND ACUTE RING ABOVE AND ACUTE 

01FC 01FD LATIN CAPITAL LIGATURE AE LATIN SMALL LIGATURE AE WITH 
WITH ACUTE ACUTE 

O1FE O1FF LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH STROKE AND ACUTE STROKE AND ACUTE 

0200 0201 LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
DOUBLE GRAVE DOUBLE GRAVE 

0202 0203 LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
INVERTED BREVE INVERTED BREVE 

0204 0205 LATIN CAPITAL LETTER E WITH|LATIN SMALL LETTER E WITH 
DOUBLE GRAVE DOUBLE GRAVE 

0206 0207 LATIN CAPITAL LETTER E WITH|LATIN SMALL LETTER E WITH 
INVERTED BREVE INVERTED BREVE 

0208 0209 LATIN CAPITAL LETTER | WITH |LATIN SMALL LETTER | WITH 
DOUBLE GRAVE DOUBLE GRAVE 

020A 020B LATIN CAPITAL LETTER | WITH |LATIN SMALL LETTER | WITH 
INVERTED BREVE INVERTED BREVE 

020C 020D LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH DOUBLE GRAVE DOUBLE GRAVE 

020E 020F LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH INVERTED BREVE INVERTED BREVE 

0210 0211 LATIN CAPITAL LETTER R LATIN SMALL LETTER R WITH 
WITH DOUBLE GRAVE DOUBLE GRAVE 

0212 0213 LATIN CAPITAL LETTER R LATIN SMALL LETTER R WITH 
WITH INVERTED BREVE INVERTED BREVE 

0214 0215 LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH DOUBLE GRAVE DOUBLE GRAVE 

0216 0217 LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH INVERTED BREVE INVERTED BREVE 

0386 03AC GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH TONOS WITH TONOS 

0388 03AD GREEK CAPITAL LETTER GREEK SMALL LETTER EPSILON 
EPSILON WITH TONOS WITH TONOS 

0389 O3AE GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 
WITH TONOS TONOS 

038A O3AF GREEK CAPITAL LETTER IOTA | GREEK SMALL LETTER IOTA WITH 
WITH TONOS TONOS 

038C 03CC GREEK CAPITAL LETTER GREEK SMALL LETTER OMICRON 
OMICRON WITH TONOS WITH TONOS 

038E 03CD GREEK CAPITAL LETTER GREEK SMALL LETTER UPSILON 
UPSILON WITH TONOS WITH TONOS 

038F 03CE GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMEGA WITH TONOS WITH TONOS 

0391 03B1 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA 

0392 03B2 GREEK CAPITAL LETTER BETA | GREEK SMALL LETTER BETA 
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Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


0393 03B3 GREEK CAPITAL LETTER GREEK SMALL LETTER GAMMA 
GAMMA 

0394 03B4 GREEK CAPITAL LETTER GREEK SMALL LETTER DELTA 
DELTA 

0395 03B5 GREEK CAPITAL LETTER GREEK SMALL LETTER EPSILON 
EPSILON 

0396 03B6 GREEK CAPITAL LETTER ZETA | GREEK SMALL LETTER ZETA 

0397 03B7 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA 

0398 03B8 GREEK CAPITAL LETTER GREEK SMALL LETTER THETA 
THETA 

0399 03B9 GREEK CAPITAL LETTER IOTA | GREEK SMALL LETTER IOTA 

039A 03BA GREEK CAPITAL LETTER GREEK SMALL LETTER KAPPA 
KAPPA 

039B 03BB GREEK CAPITAL LETTER GREEK SMALL LETTER LAMDA 
LAMDA 

039C 03BC GREEK CAPITAL LETTER MU GREEK SMALL LETTER MU 

039D 03BD GREEK CAPITAL LETTER NU GREEK SMALL LETTER NU 

039E O3BE GREEK CAPITAL LETTER XI GREEK SMALL LETTER XI 

039F O3BF GREEK CAPITAL LETTER GREEK SMALL LETTER OMICRON 
OMICRON 

03A0 03C0 GREEK CAPITAL LETTER PI GREEK SMALL LETTER PI 

03A1 03C1 GREEK CAPITAL LETTER RHO | GREEK SMALL LETTER RHO 

03A3 03C3 GREEK CAPITAL LETTER GREEK SMALL LETTER SIGMA 
SIGMA 

03A4 0304 GREEK CAPITAL LETTER TAU | GREEK SMALL LETTER TAU 

03A5 03C5 GREEK CAPITAL LETTER GREEK SMALL LETTER UPSILON 
UPSILON 

03A6 03C6 GREEK CAPITAL LETTER PHI GREEK SMALL LETTER PHI 

03A7 03C7 GREEK CAPITAL LETTER CHI | GREEK SMALL LETTER CHI 

03A8 03C8 GREEK CAPITAL LETTER PSI GREEK SMALL LETTER PSI 

03A9 03C9 GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMEGA 

O3AA 03CA GREEK CAPITAL LETTER IOTA | GREEK SMALL LETTER IOTA WITH 
WITH DIALYTIKA DIALYTIKA 

03AB 03CB GREEK CAPITAL LETTER GREEK SMALL LETTER UPSILON 
UPSILON WITH DIALYTIKA WITH DIALYTIKA 

03E2 03E3 COPTIC CAPITAL LETTER SHEI | COPTIC SMALL LETTER SHEI 

03E4 03E5 COPTIC CAPITAL LETTER FEI |COPTIC SMALL LETTER FEI 

03E6 03E7 COPTIC CAPITAL LETTER KHEI | COPTIC SMALL LETTER KHEI 

03E8 03E9 COPTIC CAPITAL LETTER HORI | COPTIC SMALL LETTER HORI 

O3EA 03EB COPTIC CAPITAL LETTER COPTIC SMALL LETTER GANGIA 
GANGIA 

03EC 03ED COPTIC CAPITAL LETTER COPTIC SMALL LETTER SHIMA 


SHIMA 


28 iSeries: Globalization (Handle data in global applications) 


Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


O3EE O3EF COPTIC CAPITAL LETTER DEI | COPTIC SMALL LETTER DEI 

0401 0451 CYRILLIC CAPITAL LETTER IO | CYRILLIC SMALL LETTER IO 

0402 0452 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER DJE 
DJE (SERBOCROATIAN) (SERBOCROATIAN) 

0403 0453 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER GJE 
GJE 

0404 0454 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER 
UKRAINIAN IE UKRAINIAN IE 

0405 0455 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER DZE 
DZE 

0406 0456 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER 
BYELORUSSIAN_UKRAINIAN |_| BYELORUSSIAN-UKRAINIAN | 

0407 0457 CYRILLIC CAPITAL LETTER YI | CYRILLIC SMALL LETTER YI 
(UKRANIAN) (UKRANIAN) 

0408 0458 CYRILLIC CAPITAL LETTER JE | CYRILLIC SMALL LETTER JE 

0409 0459 CYRILLIC CAPITAL LETTER LJE | CYRILLIC SMALL LETTER LJE 

040A 045A CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER NJE 
NJE 

040B 045B CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER TSHE 
TSHE (SERBOCROATIAN) (SERBOCROATIAN) 

040C 045C CYRILLIC CAPITAL LETTER KJE| CYRILLIC SMALL LETTER KJE 

040E 045E CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER SHORT 
SHORT U (BYELORUSSIAN) U (BYELORUSSIAN) 

040F 045F CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER DZHE 
DZHE 

0410 0430 CYRILLIC CAPITAL LETTERA | CYRILLIC SMALL LETTER A 

0411 0431 CYRILLIC CAPITAL LETTER BE | CYRILLIC SMALL LETTER BE 

0412 0432 CYRILLIC CAPITAL LETTER VE | CYRILLIC SMALL LETTER VE 

0413 0433 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER GHE 
GHE 

0414 0434 CYRILLIC CAPITAL LETTER DE | CYRILLIC SMALL LETTER DE 

0415 0435 CYRILLIC CAPITAL LETTER IE | CYRILLIC SMALL LETTER IE 

0416 0436 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER ZHE 
ZHE 

0417 0437 CYRILLIC CAPITAL LETTER ZE | CYRILLIC SMALL LETTER ZE 

0418 0438 CYRILLIC CAPITALLETTER! | CYRILLIC SMALL LETTER | 

0419 0439 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER SHORT | 
SHORT | 

041A 043A CYRILLIC CAPITAL LETTER KA | CYRILLIC SMALL LETTER KA 

041B 043B CYRILLIC CAPITAL LETTER EL | CYRILLIC SMALL LETTER EL 

041C 043C CYRILLIC CAPITAL LETTER EM | CYRILLIC SMALL LETTER EM 

041D 043D CYRILLIC CAPITAL LETTER EN | CYRILLIC SMALL LETTER EN 

O41E 043E CYRILLIC CAPITAL LETTER O | CYRILLIC SMALL LETTER O 

041F 043F CYRILLIC CAPITAL LETTER PE | CYRILLIC SMALL LETTER PE 
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Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


0420 0440 CYRILLIC CAPITAL LETTER ER | CYRILLIC SMALL LETTER ER 

0421 0441 CYRILLIC CAPITAL LETTER ES |CYRILLIC SMALL LETTER ES 

0422 0442 CYRILLIC CAPITAL LETTER TE | CYRILLIC SMALL LETTER TE 

0423 0443 CYRILLIC CAPITAL LETTER U- | CYRILLIC SMALL LETTER U 

0424 0444 CYRILLIC CAPITAL LETTER EF |CYRILLIC SMALL LETTER EF 

0425 0445 CYRILLIC CAPITAL LETTER HA | CYRILLIC SMALL LETTER HA 

0426 0446 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER TSE 
TSE 

0427 0447 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER CHE 
CHE 

0428 0448 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER SHA 
SHA 

0429 0449 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER SHCHA 
SHCHA 

042A 044A CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER HARD 
HARD SIGN SIGN 

042B 044B CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER YERU 
YERU 

042C 044C CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER SOFT 
SOFT SIGN SIGN 

042D 044D CYRILLIC CAPITAL LETTER E  |CYRILLIC SMALL LETTER E 

042E 044E CYRILLIC CAPITAL LETTER YU | CYRILLIC SMALL LETTER YU 

042F 044F CYRILLIC CAPITAL LETTER YA |CYRILLIC SMALL LETTER YA 

0460 0461 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER OMEGA 
OMEGA 

0462 0463 CYRILLIC CAPITAL LETTER YAT | CYRILLIC SMALL LETTER YAT 

0464 0465 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER 
IOTIFIED E IOTIFIED E 

0466 0467 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER LITTLE 
LITTLE YUS YUS 

0468 0469 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER IOTFIED 
IOTIFIED LITTLE YUS LITTLE YUS 

046A 046B CYRILLIC CAPITAL LETTER BIG | CYRILLIC SMALL LETTER BIG YUS 
YUS 

046C 046D CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER 
IOTIFIED BIG YUS IOTIFIED BIG YUS 

046E 046F CYRILLIC CAPITAL LETTER KSI | CYRILLIC SMALL LETTER KSI 

0470 0471 CYRILLIC CAPITAL LETTER PSI | CYRILLIC SMALL LETTER PSI 

0472 0473 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER FITA 
FITA 

0474 0475 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER IZHITSA 
IZHITSA 

0476 0477 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER IZHITSA 


IZHITSA WITH DOUBLE GRAVE 
ACCENT 


WITH DOUBLE GRAVE ACCENT 


30 = iSeries: Globalization (Handle data in global applications) 


Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


0478 0479 CYRILLIC CAPITAL LETTER UK | CYRILLIC SMALL LETTER UK 

047A 047B CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER ROUND 
ROUND OMEGA OMEGA 

047C 047D CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER OMEGA 
OMEGA WITH TITLO WITH TITLO 

047E 047F CYRILLIC CAPITAL LETTER OT | CYRILLIC SMALL LETTER OT 

0480 0481 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER KOPPA 
KOPPA 

0490 0491 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER GHE 
GHE WITH UPTURN WITH UPTURN 

0492 0493 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER GHE 
GHE WITH STROKE WITH STROKE 

0494 0495 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER GHE 
GHE WITH MIDDLE HOOK WITH MIDDLE HOOK 

0496 0497 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER ZHE 
ZHE WITH DESCENDER WITH DESCENDER 

0498 0499 CYRILLIC CAPITAL LETTER ZE | CYRILLIC SMALL LETTER ZE WITH 
WITH DESCENDER DESCENDER 

049A 049B CYRILLIC CAPITAL LETTER KA | CYRILLIC SMALL LETTER KA WITH 
WITH DESCENDER DESCENDER 

049C 049D CYRILLIC CAPITAL LETTER KA | CYRILLIC SMALL LETTER KA WITH 
WITH VERTICAL STROKE VERTICAL STROKE 

049E 049F CYRILLIC CAPITAL LETTER KA | CYRILLIC SMALL LETTER KA WITH 
WITH STROKE STROKE 

04A0 04A1 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER 
BASHKIR KA EASHKIR KA 

04A2 04A3 CYRILLIC CAPITAL LETTER EN | CYRILLIC SMALL LETTER EN 
WITH DESCENDER WITH DESCENDER 

04A4 04A5 CYRILLIC CAPITAL LIGATURE | CYRILLIC SMALL LIGATURE EN 
EN GHF GHE 

04A6 04A7 CYRILLIC CAPITAL LETTER PE | CYRILLIC SMALL LETTER PE 
WITH MIDDLE HOOK WITH MIDDLE HOOK (ABKHASIAN) 
(ABKHASIAN) 

04A8 04A9 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER 
ABKHASIAN HA ABKHASIAN HA 

O4AA 04AB CYRILLIC CAPITAL LETTER ES | CYRILLIC SMALL LETTER ES 
WITH DESCENDER WITH DESCENDER 

04AC 04AD CYRILLIC CAPITAL LETTER TE | CYRILLIC SMALL LETTER TE WITH 
WITH DESCENDER DESCENDER 

O4AE O4AF CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER 
STRAIGHT U STRAIGHT U 

04B0 04B1 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER 
STRAIGHT U WITH STROKE | STRAIGHT U WITH STROKE 

04B2 04B3 CYRILLIC CAPITAL LETTER HA | CYRILLIC SMALL LETTER HA 
WITH DESCENDER WITH DESCENDER 

04B4 04B5 CYRILLIC CAPITAL LIGATURE | CYRILLIC SMALL LIGATURE TE 


TE TSE (ABKHASIAN) 


TSE (ABKHASIAN) 
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Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


04B6 04B7 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER CHE 
CHE WITH DESCENDER WITH DESCENDER 

04B8 04B9 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER CHE 
CHE WITH VERTICAL STROKE | WITH VERTICAL STROKE 

04BA 04BB CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER SHHA 
SHHA 

04BC 04BD CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER 
ABKHASIAN CHE ABKHASIAN CHE 

04BE 04BF CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER 
ABKHASIAN CHE WITH ABKHASIAN CHE WITH 
DESCENDER DESCENDER 

04C1 04C2 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER ZHE 
ZHE WITH BREVE WITH BREVE 

04C3 0404 CYRILLIC CAPITAL LETTER KA | CYRILLIC SMALL LETTER KA WITH 
WITH HOOK HOOK 

04C7 04C8 CYRILLIC CAPITAL LETTER EN | CYRILLIC SMALL LETTER EN 
WITH HOOK WITH HOOK 

04CB 04CC CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER 
KHAKASSIAN CHE KHAKASSIAN CHE 

04D0 04D1 CYRILLIC CAPITALLETTERA = |CYRILLIC SMALL LETTER A WITH 
WITH BREVE BREVE 

04D2 04D3 CYRILLIC CAPITAL LETTERA = |CYRILLIC SMALL LETTER A WITH 
WITH DIAERESIS DIAERESIS 

04D4 04D5 CYRILLIC CAPITAL LIGATURE A | CYRILLIC SMALL LIGATURE A IE 
IE 

04D6 04D7 CYRILLIC CAPITAL LETTER IE | CYRILLIC SMALL LETTER IE WITH 
WITH BREVE BREVE 

04D8 04D9 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER SCHWA 
SCHWA 

04DA 04DB CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER SCHWA 
SCHWA WITH DIAERESIS WITH DIAERESIS 

04DC 04DD CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER ZHE 
ZHE WITH DIAERESIS WITH DIAERESIS 

04DE 04DF CYRILLIC CAPITAL LETTER ZE |CYRILLIC SMALL LETTER ZE WITH 
WITH DIAERESIS DIAERESIS 

04E0 04E1 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER 
ABKHASIAN DZE ABKHASIAN DZE 

04E2 04E3 CYRILLIC CAPITAL LETTER | CYRILLIC SMALL LETTER | WITH 
WITH MACRON MACRON 

04E4 04E5 CYRILLIC CAPITAL LETTER | CYRILLIC SMALL LETTER | WITH 
WITH DIAERESIS DIAERESIS 

04E6 04E7 CYRILLIC CAPITAL LETTER O- | CYRILLIC SMALL LETTER O WITH 
WITH DIAERESIS DIAERESIS 

04E8 04E9 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER BARRED 
BARRED O O 

O04EA 04EB CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER BARRED 


BARRED O WITH DIAERESIS 


0 WITH DIAERESIS 


32 iSeries: Globalization (Handle data in global applications) 


Uppercase code 


Lowercase code 


Uppercase character 


point point description Lowercase character description 

O4EE O4EF CYRILLIC CAPITAL LETTER U CYRILLIC SMALL LETTER U WITH 
WITH MACRON MACRON 

04F0O O4F 1 CYRILLIC CAPITAL LETTER U CYRILLIC SMALL LETTER U WITH 
WITH DIAERESIS DIAERESIS 

04F2 04F3 CYRILLIC CAPITAL LETTER U CYRILLIC SMALL LETTER U WITH 
WITH DOUBLE ACUTE DOUBLE ACUTE 

04F4 04F5 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER CHE 
CHE WITH DIAERESIS WITH DIAERESIS 

04F8 04F9 CYRILLIC CAPITAL LETTER CYRILLIC SMALL LETTER YERU 
YERU WITH DIAERESIS WITH DIAERESIS 

0531 0561 ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER AYB 
AYB 

0532 0562 ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER BEN 
BEN 

0533 0563 ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER GIM 
GIM 

0534 0564 ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER DA 
DA 

0535 0565 ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER ECH 
ECH 

0536 0566 ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER ZA 
ZA 

0537 0567 ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER EH 
EH 

0538 0568 ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER ET 
ET 

0539 0569 ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER TO 
TO 

053A 056A ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER ZHE 
ZHE 

053B 056B ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER INI 
INI 

053C 056C ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER LIWN 
LIWN 

053D 056D ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER XEH 
XEH 

053E 056E ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER CA 
CA 

053F O56F ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER KEN 
KEN 

0540 0570 ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER HO 
HO 

0541 0571 ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER JA 
JA 

0542 0572 ARMENIAN CAPITAL LETTER ARMENIAN SMALL LETTER GHAD 


GHAD 


OS/400 globalization 33 


Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


0543 0573 ARMENIAN CAPITAL LETTER | ARMENIAN SMALL LETTER CHEH 
CHEH 

0544 0574 ARMENIAN CAPITAL LETTER | ARMENIAN SMALL LETTER MEN 
MEN 

0545 0575 ARMENIAN CAPITAL LETTER YI | ARMENIAN SMALL LETTER Y! 

0546 0576 ARMENIAN CAPITAL LETTER | ARMENIAN SMALL LETTER NOW 
NOW 

0547 0577 ARMENIAN CAPITAL LETTER |ARMENIAN SMALL LETTER SNA 
SHA 

0548 0578 ARMENIAN CAPITAL LETTER = |ARMENIAN SMALL LETTER VO 
vo 

0549 0579 ARMENIAN CAPITAL LETTER | ARMENIAN SMALL LETTER CHA 
CHA 

054A 057A ARMENIAN CAPITAL LETTER | ARMENIAN SMALL LETTER PEH 
PEH 

054B 057B ARMENIAN CAPITAL LETTER = |ARMENIAN SMALL LETTER JHEH 
JHEH 

054C 057C ARMENIAN CAPITAL LETTER =| ARMENIAN SMALL LETTER RA 
RA 

054D 057D ARMENIAN CAPITAL LETTER |ARMENIAN SMALL LETTER SEH 
SEH 

054E 057E ARMENIAN CAPITAL LETTER = |ARMENIAN SMALL LETTER VEW 
VEW 

054F 057F ARMENIAN CAPITAL LETTER | ARMENIAN SMALL LETTER TIWN 
TIWN 

0550 0580 ARMENIAN CAPITAL LETTER |ARMENIAN SMALL LETTER REH 
REH 

0551 0581 ARMENIAN CAPITAL LETTER | ARMENIAN SMALL LETTER CO 
co 

0552 0582 ARMENIAN CAPITAL LETTER ~|ARMENIAN SMALL LETTER YIWN 
YIWN 

0553 0583 ARMENIAN CAPITAL LETTER = |ARMENIAN SMALL LETTER PIWP 
PIWR 

0554 0584 ARMENIAN CAPITAL LETTER | ARMENIAN SMALL LETTER KEH 
KEH 

0555 0585 ARMENIAN CAPITAL LETTER | ARMENIAN SMALL LETTER OH 
OH 

0556 0586 ARMENIAN CAPITAL LETTER | ARMENIAN SMALL LETTER FEH 
FEH 

10A0 10D0 GEORGIAN CAPITAL LETTER |GEORGIAN LETTER AN 
AN (KHUTSURI) 

10A1 10D1 GEORGIAN CAPITAL LETTER | GEORGIAN LETTER BAN 
BAN (KHUTSURI) 

10A2 10D2 GEORGIAN CAPITALLETTER |GEORGIAN LETTER GAN 
GAN (KHUTSURI) 

10A3 10D3 GEORGIAN CAPITALLETTER |GEORGIAN LETTER DON 


DON (KHUTSURI) 


34 iSeries: Globalization (Handle data in global applications) 


Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


10A4 10D4 GEORGIAN CAPITALLETTER | GEORGIAN LETTER EN 
EN (KHUTSURI) 

10A5 10D5 GEORGIAN CAPITAL LETTER | GEORGIAN LETTER VIN 
VIN (KHUTSURI) 

10A6 10D6 GEORGIAN CAPITAL LETTER | GEORGIAN LETTER ZEN 
ZEN (KHUTSURI) 

10A7 10D7 GEORGIAN CAPITAL LETTER | GEORGIAN LETTER TAN 
TAN (KHUTSURI) 

10A8 10D8 GEORGIAN CAPITAL LETTER | GEORGIAN LETTER IN 
IN (KHUTSURI) 

10A9 10D9 GEORGIAN CAPITAL LETTER | GEORGIAN LETTER KAN 
KAN (KHUTSURI) 

10AA 10DA GEORGIAN CAPITAL LETTER |GEORGIAN LETTER LAS 
LAS (KHUTSURI) 

10AB 10DB GEORGIAN CAPITALLETTER |GEORGIAN LETTER MAN 
MAN (KHUTSURI) 

10AC 10DC GEORGIAN CAPITALLETTER | GEORGIAN LETTER NAR 
NAR (KHUTSURI) 

10AD 10DD GEORGIAN CAPITALLETTER | GEORGIAN LETTER ON 
ON (KHUTSURI) 

10AE 10DE GEORGIAN CAPITALLETTER |GEORGIAN LETTER PAR 
PAR (KHUTSURI) 

10AF 10DF GEORGIAN CAPITALLETTER |GEORGIAN LETTER ZHAR 
ZHAR (KHUTSURI) 

10B0 10E0 GEORGIAN CAPITAL LETTER | GEORGIAN LETTER RAE 
RAE (KHUTSURI) 

10B1 10E1 GEORGIAN CAPITAL LETTER |GEORGIAN LETTER SAN 
SAN (KHUTSURI) 

10B2 10E2 GEORGIAN CAPITAL LETTER |GEORGIAN LETTER TAR 
TAR (KHUTSURI) 

10B3 10E3 GEORGIAN CAPITALLETTER |GEORGIAN LETTER UN 
UN (KHUTSURI) 

10B4 10E4 GEORGIAN CAPITALLETTER |GEORGIAN LETTER PHAR 
PHAR (KHUTSURI) 

10B5 10E5 GEORGIAN CAPITALLETTER |GEORGIAN LETTER KHAR 
KHAR (KHUTSURI) 

10B6 10E6 GEORGIAN CAPITALLETTER |GEORGIAN LETTER GHAN 
GHAN (KHUTSURI) 

10B7 10E7 GEORGIAN CAPITALLETTER |GEORGIAN LETTER QAR 
QAR (KHUTSURI) 

10B8 10E8 GEORGIAN CAPITAL LETTER |GEORGIAN LETTER SHIN 
SHIN (KHUTSURI) 

10B9 10E9 GEORGIAN CAPITAL LETTER |GEORGIAN LETTER CHIN 
CHIN (KHUTSURI) 

10BA 10EA GEORGIAN CAPITAL LETTER |GEORGIAN LETTER CAN 


CAN (KHUTSURI) 


OS/400 globalization 


35 


Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


10BB 10EB GEORGIAN CAPITAL LETTER | GEORGIAN LETTER JIL 
JIL (KHUTSURI) 

10BC 10EC GEORGIAN CAPITAL LETTER | GEORGIAN LETTER CIL 
CIL (KHUTSURI) 

10BD 10ED GEORGIAN CAPITALLETTER |GEORGIAN LETTER CHAR 
CHAR (KHUTSURI) 

10BE 10EE GEORGIAN CAPITALLETTER |GEORGIAN LETTER XAN 
XAN (KHUTSURI) 

10BF 10EF GEORGIAN CAPITALLETTER | GEORGIAN LETTER JHAN 
JHAN (KHUTSURI) 

10C0 10FO GEORGIAN CAPITALLETTER | GEORGIAN LETTER HAE 
HAE (KHUTSURI) 

10C1 10F1 GEORGIAN CAPITALLETTER | GEORGIAN LETTER HE 
HE (KHUTSURI) 

10C2 10F2 GEORGIAN CAPITALLETTER | GEORGIAN LETTER HIE 
HIE (KHUTSURI) 

10C3 10F3 GEORGIAN CAPITALLETTER | GEORGIAN LETTER WE 
WE (KHUTSURI) 

10C4 10F4 GEORGIAN CAPITALLETTER |GEORGIAN LETTER HAR 
HAR (KHUTSURI) 

10C5 10F5 GEORGIAN CAPITALLETTER |GEORGIAN LETTER HOE 
HOE (KHUTSURI) 

1E00 1E01 LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
RING BELOW RING BELOW 

1E02 1E03 LATIN CAPITAL LETTER B WITH| LATIN SMALL LETTER B WITH 
DOT ABOVE DOT ABOVE 

1E04 1E05 LATIN CAPITAL LETTER B WITH| LATIN SMALL LETTER B WITH 
DOT BELOW DOT BELOW 

1E06 1E07 LATIN CAPITAL LETTER B WITH | LATIN SMALL LETTER B WITH 
LINE BELOW LINE BELOW 

1E08 1E09 LATIN CAPITAL LETTER C LATIN SMALL LETTER C WITH 
WITH CEDILLA AND ACUTE CEDILLA AND ACUTE 

1E0A 1E0B LATIN CAPITAL LETTER D LATIN SMALL LETTER D WITH 
WITH DOT ABOVE DOT ABOVE 

1E0C 1E0D LATIN CAPITAL LETTER D LATIN SMALL LETTER D WITH 
WITH DOT BELOW DOT BELOW 

1E0E 1E0F LATIN CAPITAL LETTER D LATIN SMALL LETTER D WITH 
WITH LINE BELOW LINE BELOW 

1E10 1E11 LATIN CAPITAL LETTER D LATIN SMALL LETTER D WITH 
WITH CEDILLA CEDILLA 

1E12 1E13 LATIN CAPITAL LETTER D LATIN SMALL LETTER D WITH 
WITH CIRCUMFLEX BELOW __| CIRCUMFLEX BELOW 

1E14 1E15 LATIN CAPITAL LETTER E WITH| LATIN SMALL LETTER E WITH 
MACRON AND GRAVE MACRON AND GRAVE 

1E16 1E17 LATIN CAPITAL LETTER E WITH| LATIN SMALL LETTER E WITH 


MACRON AND ACUTE 


MACRON AND ACUTE 


36 iSeries: Globalization (Handle data in global applications) 


Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


1E18 1E19 LATIN CAPITAL LETTER E WITH |LATIN SMALL LETTER E WITH 
CIRCUMFLEX BELOW CIRCUMFLEX BELOW 

1E1A 1E1B LATIN CAPITAL LETTER E WITH|LATIN SMALL LETTER E WITH 
TILDE BELOW TILDE BELOW 

1E1C 1E1D LATIN CAPITAL LETTER E WITH | LATIN SMALL LETTER E WITH 
CEDILLA AND BREVE CEDILLA AND BREVE 

1E1E 1E1F LATIN CAPITAL LETTER F WITH | LATIN SMALL LETTER F WITH 
DOT ABOVE DOT ABOVE 

1E20 1E21 LATIN CAPITAL LETTER G LATIN SMALL LETTER G WITH 
WITH MACRON MACRON 

1E22 1E23 LATIN CAPITAL LETTER H LATIN SMALL LETTER H WITH 
WITH DOT ABOVE DOT ABOVE 

1E24 1E25 LATIN CAPITAL LETTER H LATIN SMALL LETTER H WITH 
WITH DOT BELOW DOT BELOW 

1E26 1E27 LATIN CAPITAL LETTER H LATIN SMALL LETTER H WITH 
WITH DIAERESIS DIAERESIS 

1E28 1E29 LATIN CAPITAL LETTER H LATIN SMALL LETTER H WITH 
WITH CEDILLA CEDILLA 

1E2A 1E2B LATIN CAPITAL LETTER H LATIN SMALL LETTER H WITH 
WITH BREVE BELOW BREVE BELOW 

1E2C 1E2D LATIN CAPITAL LETTER | WITH |LATIN SMALL LETTER | WITH 
TILDE BELOW TILDE BELOW 

1E2E 1E2F LATIN CAPITAL LETTER | WITH |LATIN SMALL LETTER | WITH 
DIAERESIS AND ACUTE DIAERESIS AND ACUTE 

1E30 1E31 LATIN CAPITAL LETTER K WITH | LATIN SMALL LETTER K WITH 
ACUTE ACUTE 

1E32 1E33 LATIN CAPITAL LETTER K WITH | LATIN SMALL LETTER K WITH 
DOT BELOW DOT BELOW 

1E34 1E35 LATIN CAPITAL LETTER K WITH | LATIN SMALL LETTER K WITH 
LINE BELOW LINE BELOW 

1E36 1E37 LATIN CAPITAL LETTER L WITH |LATIN SMALL LETTER L WITH DOT 
DOT BELOW BELOW 

1E38 1E39 LATIN CAPITAL LETTER L WITH | LATIN SMALL LETTER L WITH DOT 
DOT BELOW AND MACRON BELOW AND MACRON 

1E3A 1E3B LATIN CAPITAL LETTER L WITH |LATIN SMALL LETTER L WITH 
LINE BELOW LINE BELOW 

1E3C 1E3D LATIN CAPITAL LETTER L WITH |LATIN SMALL LETTER L WITH 
CIRCUMFLEX BELOW CIRCUMFLEX BELOW 

1E3E 1E3F LATIN CAPITAL LETTER M LATIN SMALL LETTER M WITH 
WITH ACUTE ACUTE 

1E40 1E41 LATIN CAPITAL LETTER M LATIN SMALL LETTER M WITH 
WITH DOT ABOVE DOT ABOVE 

1E42 1E43 LATIN CAPITAL LETTER M LATIN SMALL LETTER M WITH 
WITH DOT BELOW DOT BELOW 

1E44 1E45 LATIN CAPITAL LETTER N LATIN SMALL LETTER N WITH 


WITH DOT ABOVE 


DOT ABOVE 
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Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


1E46 1E47 LATIN CAPITAL LETTER N LATIN SMALL LETTER N WITH 
WITH DOT BELOW DOT BELOW 

1E48 1E49 LATIN CAPITAL LETTER N LATIN SMALL LETTER N WITH 
WITH LINE BELOW LINE BELOW 

1E4A 1E4B LATIN CAPITAL LETTER N LATIN SMALL LETTER N WITH 
WITH CIRCUMFLEX BELOW CIRCUMFLEX BELOW 

1E4C 1E4D LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH TILDE AND ACUTE TILDE AND ACUTE 

1E4E 1E4F LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH TILDE AND DIAERESIS TILDE AND DIAERESIS 

1E50 1E51 LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH MACRON AND GRAVE MACRON AND GRAVE 

1E52 1E53 LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH MACRON AND ACUTE MACRON AND ACUTE 

1E54 1E55 LATIN CAPITAL LETTER P WITH | LATIN SMALL LETTER P WITH 
ACUTE ACUTE 

1E56 1E57 LATIN CAPITAL LETTER P WITH |LATIN SMALL LETTER P WITH 
DOT ABOVE DOT ABOVE 

1E58 1E59 LATIN CAPITAL LETTER R LATIN SMALL LETTER R WITH 
WITH DOT ABOVE DOT ABOVE 

1E5A 1E5B LATIN CAPITAL LETTER R LATIN SMALL LETTER R WITH 
WITH DOT BELOW DOT BELOW 

1E5C 1E5D LATIN CAPITAL LETTER R LATIN SMALL LETTER R WITH 
WITH DOT BELOW AND DOT BELOW AND MACRON 
MACRON 

1E5E 1E5F LATIN CAPITAL LETTER R LATIN SMALL LETTER R WITH 
WITH LINE BELOW LINE BELOW 

1E60 1E61 LATIN CAPITAL LETTER S WITH} LATIN SMALL LETTER S WITH 


DOT ABOVE 


DOT ABOVE 


1E62 1E63 LATIN CAPITAL LETTER S WITH | LATIN SMALL LETTER S WITH 
DOT BELOW DOT BELOW 

1E64 1E65 LATIN CAPITAL LETTER S WITH|LATIN SMALL LETTER S WITH 
ACUTE AND DOT ABOVE ACUTE AND DOT ABOVE 

1E66 1E67 LATIN CAPITAL LETTER S WITH| LATIN SMALL LETTER S WITH 
CARON AND DOT ABOVE CARON AND DOT ABOVE 

1E68 1E69 LATIN CAPITAL LETTER S WITH| LATIN SMALL LETTER S WITH 
DOT BELOW AND DOT ABOVE |DOT BELOW AND DOT ABOVE 

1E6A 1E6B LATIN CAPITAL LETTER T WITH | LATIN SMALL LETTER T WITH DOT 
DOT ABOVE ABOVE 

1E6C 1E6D LATIN CAPITAL LETTER T WITH | LATIN SMALL LETTER T WITH DOT 
DOT BELOW BELOW 

1E6E 1E6F LATIN CAPITAL LETTER T WITH | LATIN SMALL LETTER T WITH 
LINE BELOW LINE BELOW 

1E70 1E71 LATIN CAPITAL LETTER T WITH | LATIN SMALL LETTER T WITH 
CIRCUMFLEX BELOW CIRCUMFLEX BELOW 

1E72 1E73 LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 


WITH DIAERESIS BELOW 


DIAERESIS BELOW 
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Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


1E74 1E75 LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH TILDE BELOW TILDE BELOW 

1E76 1E77 LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH CIRCUMFLEX BELOW CIRCUMFLEX BELOW 

1E78 1E79 LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH TILDE AND ACUTE TILDE AND ACUTE 

1E7A 1E7B LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH MACRON AND MACRON AND DIAERESIS 
DIAERESIS 

1E7C 1E7D LATIN CAPITAL LETTER V WITH | LATIN SMALL LETTER V WITH 
TILDE TILDE 

1E7E 1E7F LATIN CAPITAL LETTER V WITH} LATIN SMALL LETTER V WITH 
DOT BELOW DOT BELOW 

1E80 1E81 LATIN CAPITAL LETTER W LATIN SMALL LETTER W WITH 
WITH GRAVE GRAVE 

1E82 1E83 LATIN CAPITAL LETTER W LATIN SMALL LETTER W WITH 
WITH ACUTE ACUTE 

1E84 1E85 LATIN CAPITAL LETTER W LATIN SMALL LETTER W WITH 
WITH DIAERESIS DIAERESIS 

1E86 1E87 LATIN CAPITAL LETTER W LATIN SMALL LETTER W WITH 
WITH DOT ABOVE DOT ABOVE 

1E88 1E89 LATIN CAPITAL LETTER W LATIN SMALL LETTER W WITH 
WITH DOT BELOW DOT BELOW 

1E8A 1E8B LATIN CAPITAL LETTER X WITH} LATIN SMALL LETTER X WITH 
DOT ABOVE DOT ABOVE 

1E8C 1E8D LATIN CAPITAL LETTER X5 LATIN SMALL LETTER X WITH 
WITH DIAERESIS DIAERESIS 

1E8E 1E8F LATIN CAPITAL LETTER Y WITH | LATIN SMALL LETTER Y WITH 
DOT ABOVE DOT ABOVE 

1E90 1E91 LATIN CAPITAL LETTER Z WITH | LATIN SMALL LETTER Z WITH 
CIRCUMFLEX CIRCUMFLEX 

1E92 1E93 LATIN CAPITAL LETTER Z WITH | LATIN SMALL LETTER Z WITH 
DOT BELOW DOT BELOW 

1E94 1E95 LATIN CAPITAL LETTER Z WITH | LATIN SMALL LETTER Z WITH 
LINE BELOW LINE BELOW 

1EA0O 1EA1 LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH DOT 
DOT BELOW BELOW 

1EA2 1EA3 LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
HOOK ABOVE HOOK ABOVE 

1EA4 1EA5 LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
CIRCUMFLEX AND ACUTE CIRCUMFLEX AND ACUTE 

1EA6 1EA7 LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
CIRCUMFLEX AND GRAVE CIRCUMFLEX AND GRAVE 

1EA8 1EA9 LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
CIRCUMFLEX AND HOOK CIRCUMFLEX AND HOOK ABOVE 
ABOVE 

1EAA 1EAB LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 


CIRCUMFLEX AND TILDE 


CIRCUMFLEX AND TILDE 
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Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


1EAC 1EAD LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
CIRCUMFLEX AND DOT CIRCUMFLEX AND DOT BELOW 
BELOW 

1EAE 1EAF LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
BREVE AND ACUTE BREVE AND ACUTE 

1EBO 1EB1 LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
BREVE AND GRAVE BREVE AND GRAVE 

1EB2 1EB3 LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
BREVE AND HOOK ABOVE BREVE AND HOOK ABOVE 

1EB4 1EB5 LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
BREVE AND TILDE BREVE AND TILDE 

1EB6 1EB7 LATIN CAPITAL LETTER A WITH | LATIN SMALL LETTER A WITH 
BREVE AND DOT BELOW BREVE AND DOT BELOW 

1EB8 1EB9 LATIN CAPITAL LETTER E WITH |LATIN SMALL LETTER E WITH 
DOT BELOW DOT BELOW 

1EBA 1EBB LATIN CAPITAL LETTER E WITH | LATIN SMALL LETTER E WITH 
HOOK ABOVE HOOK ABOVE 

1EBC 1EBD LATIN CAPITAL LETTER E WITH |LATIN SMALL LETTER E WITH 
TILDE TILDE 

1EBE 1EBF LATIN CAPITAL LETTER E WITH|LATIN SMALL LETTER E WITH 


CIRCUMFLEX AND ACUTE 


CIRCUMFLEX AND ACUTE 


1ECO 1EC1 LATIN CAPITAL LETTER E WITH|LATIN SMALL LETTER E WITH 
CIRCUMFLEX AND GRAVE CIRCUMFLEX AND GRAVE 

1EC2 1EC3 LATIN CAPITAL LETTER E WITH |LATIN SMALL LETTER E WITH 
CIRCUMFLEX AND HOOK CIRCUMFLEX AND HOOK ABOVE 
ABOVE 

1EC4 1EC5 LATIN CAPITAL LETTER E WITH |LATIN SMALL LETTER E WITH 
CIRCUMFLEX AND TILDE CIRCUMFLEX AND TILDE 

1EC6 1EC7 LATIN CAPITAL LETTER E WITH |LATIN SMALL LETTER E WITH 
CIRCUMFLEX AND DOT CIRCUMFLEX AND DOT BELOW 
BELOW 

1EC8 1EC9 LATIN CAPITAL LETTER | WITH |LATIN SMALL LETTER | WITH 
HOOK ABOVE HOOK ABOVE 

1ECA 1ECB LATIN CAPITAL LETTER | WITH |LATIN SMALL LETTER | WITH DOT 
DOT BELOW BELOW 

1ECC 1ECD LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH DOT BELOW DOT BELOW 

1ECE 1ECF LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH HOOK ABOVE HOOK ABOVE 

1EDO 1ED1 LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH CIRCUMFLEX AND CIRCUMFLEX AND ACUTE 
ACUTE 

1ED2 1ED3 LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH CIRCUMFLEX AND CIRCUMFLEX AND GRAVE 
GRAVE 

1ED4 1ED5 LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 


WITH CIRCUMFLEX AND HOOK 
ABOVE 


CIRCUMFLEX AND HOOK ABOVE 


40 iSeries: Globalization (Handle data in global applications) 


Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


1ED6 1ED7 LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH CIRCUMFLEX AND TILDE | CIRCUMFLEX AND TILDE 

1ED8 1ED9 LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH CIRCUMFLEX AND DOT | CIRCUMFLEX AND DOT BELOW 
BELOW 

1EDA 1EDB LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH HORN AND ACUTE HORN AND ACUTE 

1EDC 1EDD LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH HORN AND GRAVE HORN AND GRAVE 

1EDE 1EDF LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH HORN AND HOOK ABOVE | HORN AND HOOK ABOVE 

1EE0O 1EE1 LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH HORN AND TILDE HORN AND TILDE 

1EE2 1EE3 LATIN CAPITAL LETTER O LATIN SMALL LETTER O WITH 
WITH HORN AND DOT BELOW _| HORN AND DOT BELOW 

1EE4 1EE5 LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH DOT BELOW DOT BELOW 

1EE6 1EE7 LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH HOOK ABOVE HOOK ABOVE 

1EE8 1EE9 LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH HORN AND ACUTE HORN AND ACUTE 

1EEA 1EEB LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH HORN AND GRAVE HORN AND GRAVE 

1EEC 1EED LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH HORN AND HOOK ABOVE | HORN AND HOOK ABOVE 

1EEE 1EEF LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH HORN AND TILDE HORN AND TILDE 

1EFO 1EF1 LATIN CAPITAL LETTER U LATIN SMALL LETTER U WITH 
WITH HORN AND DOT BELOW _| HORN AND DOT BELOW 

1EF2 1EF3 LATIN CAPITAL LETTER Y WITH | LATIN SMALL LETTER Y WITH 
GRAVE GRAVE 

1EF4 1EF5 LATIN CAPITAL LETTER Y WITH | LATIN SMALL LETTER Y WITH 
DOT BELOW DOT BELOW 

1EF6 1EF7 LATIN CAPITAL LETTER Y WITH | LATIN SMALL LETTER Y WITH 
HOOK ABOVE HOOK ABOVE 

1EF8 1EF9 LATIN CAPITAL LETTER Y WITH | LATIN SMALL LETTER Y WITH 
TILDE TILDE 

1F08 1F00 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH PSILI WITH PSILI 

1F09 1F01 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH DASIA WITH DASIA 

1FOA 1F02 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH PSILI AND VARIA | WITH PSILI AND VARIA 

1FOB 1F03 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH DASIA AND VARIA | WITH DASIA AND VARIA 

1FOC 1F04 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 


ALPHA WITH PSILI AND OXIA 


WITH PSILI AND OXIA 
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Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


1FOD 1F05 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH DASIA AND OXIA | WITH DASIA AND OXIA 

1FOE 1F06 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH PSILI AND WITH PSILI AND PERISPOMENI 
PERISPOMENI 

1FOF 1FO07 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH DASIA AND WITH DASIA AND PERISPOMENI 
PERISPOMENI 

1F18 1F10 GREEK CAPITAL LETTER GREEK SMALL LETTER EPSILON 
EPSILON WITH PSILI WITH PSILI 

1F19 1F11 GREEK CAPITAL LETTER GREEK SMALL LETTER EPSILON 
EPSILON WITH DASIA WITH DASIA 

1FIA 1F12 GREEK CAPITAL LETTER GREEK SMALL LETTER EPSILON 
EPSILON WITH PSILI AND WITH PSILI AND VARIA 
VARIA 

1F1B 1F13 GREEK CAPITAL LETTER GREEK SMALL LETTER EPSILON 
EPSILON WITH DASIA AND WITH DASIA AND VARIA 
VARIA 

1F1C 1F14 GREEK CAPITAL LETTER GREEK SMALL LETTER EPSILON 
EPSILON WITH PSILI AND OXIA | WITH PSILI AND OXIA 

1F1D 1F15 GREEK CAPITAL LETTER GREEK SMALL LETTER EPSILON 
EPSILON WITH DASIA AND WITH DASIA AND OXIA 
OXIA 

1F28 1F20 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 
WITH PSILI PSILI 

1F29 1F21 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 
WITH DASIA DASIA 

1F2A 1F22 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 
WITH PSILI AND VARIA PSILI AND VARIA 

1F2B 1F23 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 
WITH DASIA AND VARIA DASIA AND VARIA 

1F2C 1F24 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 
WITH PSILI AND OXIA PSILI AND OXIA 

1F2D 1F25 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 
WITH DASIA AND OXIA DASIA AND OXIA 

1F2E 1F26 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 
WITH PSILI AND PERISPOMENI | PSILI AND PERISPOMENI 

1F2F 1F27 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 
WITH DASIA AND DASIA AND PERISPOMENI 
PERISPOMENI 

1F38 1F30 GREEK CAPITAL LETTER IOTA | GREEK SMALL LETTER IOTA WITH 
WITH PSILI PSILI 

1F39 1F31 GREEK CAPITAL LETTER IOTA | GREEK SMALL LETTER IOTA WITH 
WITH DASIA DASIA 

1F3A 1F32 GREEK CAPITAL LETTER IOTA | GREEK SMALL LETTER IOTA WITH 


1F3B 


1F33 


WITH PSILI AND VARIA 


GREEK CAPITAL LETTER IOTA 
WITH DASIA AND VARIA 


PSILI AND VARIA 


GREEK SMALL LETTER IOTA WITH 
DASIA AND VARIA 
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Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


1F3C 1F34 GREEK CAPITAL LETTER IOTA | GREEK SMALL LETTER IOTA WITH 
WITH PSILI AND OXIA PSILI AND OXIA 

1F3D 1F35 GREEK CAPITAL LETTER IOTA | GREEK SMALL LETTER IOTA WITH 
WITH DASIA AND OXIA DASIA AND OXIA 

1F3E 1F36 GREEK CAPITAL LETTER IOTA | GREEK SMALL LETTER IOTA WITH 
WITH PSILI AND PERISPOMENI | PSILI AND PERISPOMENI 

1F3F 1F37 GREEK CAPITAL LETTER IOTA | GREEK SMALL LETTER IOTA WITH 
WITH DASIA AND DASIA AND PERISPOMENI 
PERISPOMENI 

1F48 1F40 GREEK CAPITAL LETTER GREEK SMALL LETTER OMICRON 
OMICRON WITH PSILI WITH PSILI 

1F49 1F41 GREEK CAPITAL LETTER GREEK SMALL LETTER OMICRON 
OMICRON WITH DASIA WITH DASIA 

1F4A 1F42 GREEK CAPITAL LETTER GREEK SMALL LETTER OMICRON 
OMICRON WITH PSILI AND WITH PSILI AND VARIA 
VARIA 

1F4B 1F43 GREEK CAPITAL LETTER GREEK SMALL LETTER OMICRON 
OMICRON WITH DASIA AND WITH DASIA AND VARIA 
VARIA 

1F4C 1F44 GREEK CAPITAL LETTER GREEK SMALL LETTER OMICRON 
OMICRON WITH PSILI AND WITH PSILI AND OXIA 
OXIA 

1F4D 1F45 GREEK CAPITAL LETTER GREEK SMALL LETTER OMICRON 
OMICRON WITH DASIA AND WITH DASIA AND OXIA 
OXIA 

1F59 1F51 GREEK CAPITAL LETTER GREEK SMALL LETTER UPSILON 
UPSILON WITH OASIS WITH DASIA 

1F5B 1F53 GREEK CAPITAL LETTER GREEK SMALL LETTER UPSILON 
UPSILON WITH DASIA AND WITH DASIA AND VARIA 
VARIA 

1F5D 1F55 GREEK CAPITAL LETTER GREEK SMALL LETTER UPSILON 
UPSILON WITH DASIA AND WITH DASIA AND OXIA 
OXIA 

1F5F 1F57 GREEK CAPITAL LETTER GREEK SMALL LETTER UPSILON 
UPSILON WITH DASIA AND WITH DASIA AND PERISPOMENI 
PERISPOMENI 

1F68 1F60 GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMEGA WITH PSILI WITH PSILI 

1F69 1F61 GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMEGA WITH DASIA WITH DASIA 

1F6A 1F62 GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMEGA WITH PSILI AND VARIA | WITH PSILI AND VARIA 

1F6B 1F63 GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMEGA WITH DASIA AND WITH DASIA AND VARIA 
VARIA 

1F6C 1F64 GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMEGA WITH PSILI AND OXIA | WITH PSILI AND OXIA 

1F6D 1F65 GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 


OMEGA WITH DASIA AND OXIA 


WITH DASIA AND OXIA 
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Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


1F6E 1F66 GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMEGA WITH PSILI AND WITH PSILI AND PERISPOMENI 
PERISPOMENI 

1F6F 1F67 GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMEGA WITH DASIA AND WITH DASIA AND PERISPOMENI 
PERISPOMENI 

1F88 1F80 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH PSILI AND WITH PSILI AND 
PROSGEGRAMMENI YPOGEGRAMMENI 

1F89 1F81 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH DASIA AND WITH DASIA AND 
PROSGEGRAMMENI YPOGEGRAMMENI 

1F8A 1F82 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH PSILI AND VARIA | WITH PSILI AND VARIA AND 
AND PROSGEGRAMMENI YPOGEGRAMMENI 

1F8B 1F83 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH DASIA AND VARIA | WITH DASIA AND VARIA AND 
AND PROSGEGRAMMENI YPOGEGRAMMENI 

1F8C 1F84 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH PSILI AND OXIA — | WITH PSILI AND OXIA AND 
AND PROSGEGRAMMEN YPOGEGRAMMENI 

1F8D 1F85 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH DASIA AND OXIA | WITH DASIA AND OXIA AND 
AND PROSGEGRAMMEN YPOGEGRAMMENI 

1F8E 1F86 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH PSILI AND WITH PSILI AND PERISPOMENI 
PERISPOMENI AND AND YPOGEGRAMMENI 
PROSGEGRAMMENI 

1F8F 1F87 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH DASIA AND WITH DASIA AND PERISPOMENI 
PERISPOMENI AND AND YPOGEGRAMMENI 
PROSGEGRAMMENI 

1F98 1F90 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 
WITH PSILI AND PSILI AND YPOGEGRAMMENI 
PROSGEGRAMMENI 

1F99 1F91 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 
WITH DASIA AND DASIA AND YPOGEGRAMMENI 
PROSGEGRAMMENI 

1F9A 1F92 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 
WITH PSILI AND VARIA AND PSILI AND VARIA AND 
PROSGEGRAMMENI YPOGEGRAMMENI 

1F9B 1F93 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 
WITH DASIA AND VARIA AND DASIA AND VARIA AND 
PROSGEGRAMMENI YPOGEGRAMMENI 

1F9C 1F94 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 
WITH PSILI AND OXIA AND PSILI AND OXIA AND 
PROSGEGRAMMENI YPOGEGRAMMENI 

1F9D 1F95 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 


WITH DASIA AND OXIA AND 
PROSGEGRAMMENI 


DASIA AND OXIA AND 
YPOGEGRAMMENI 


44 iSeries: Globalization (Handle data in global applications) 


Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


1F9E 1F96 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 
WITH PSILI AND PERISPOMENI | PSILI AND PERISPOMENI AND 
AND PROSGEGRAMMENI YPOGEGRAMMENI 

1FOF 1F97 GREEK CAPITAL LETTER ETA | GREEK SMALL LETTER ETA WITH 
WITH DASIA AND DASIA AND PERISPOMENI AND 
PERISPOMENI AND YPOGEGRAMMENI 
PROSGEGRAMMENI 

1FA8 1FAO GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMEGA WITH PSILI AND WITH PSILI AND 
PROSGEGRAMMENI YPOGEGRAMMENI 

1FA9 1FA1 GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMEGA WITH DASIA AND WITH DASIA AND 
PROSGEGRAMMENI YPOGEGRAMMENI 

1FAA 1FA2 GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMEGA WITH PSILI AND VARIA | WITH PSILI AND VARIA AND 
AND PROSGEGRAMMENI YPOGEGRAMMENI 

1FAB 1FA3 GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMEGA WITH DASIA AND WITH DASIA AND VARIA AND 
VARIA AND YPOGEGRAMMENI 
PROSGEGRAMMENI 

1FAC 1FA4 GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMEGA WITH PSILI AND OXIA | WITH PSILI AND OXIA AND 
AND PROSGEGRAMMENI YPOGEGRAMMENI 

1FAD 1FA5 GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMEGA WITH DASIA AND OXIA | WITH DASIA AND OXIA AND 
AND PROSGEGRAMMENI YPOGEGRAMMENI 

1FAE 1FA6 GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMEGA WITH PSILI AND WITH PSILI AND PERISPOMENI 
PERISPOMENI AND AND YPOGEGRAMMENI 
PROSGEGRAMMENI 

1FAF 1FA7 GREEK CAPITAL LETTER GREEK SMALL LETTER OMEGA 
OMECA WITH DASIA AND WITH DASIA AND PEPISPOMENI 
PERISPOMENI AND AND YPOGEGRAMMENI 
PROSGEGRAMMENI 

1FB8 1FBO GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH VRACHY WITH VRACHY 

1FB9 1FB1 GREEK CAPITAL LETTER GREEK SMALL LETTER ALPHA 
ALPHA WITH MACRON WITH MACRON 

1FD8 1FDO GREEK CAPITAL LETTER IOTA | GREEK SMALL LETTER IOTA WITH 
WITH VRACHY VRACHY 

1FD9 1FD1 GREEK CAPITAL LETTER IOTA | GREEK SMALL LETTER IOTA WITH 
WITH MACRON MACRON 

1FE8 1FEO GREEK CAPITAL LETTER GREEK SMALL LETTER UPSILON 
UPSILON WITH VRACHY WITH VRACHY 

1FE9 1FE1 GREEK CAPITAL LETTER GREEK SMALL LETTER UPSILON 
UPSILON WITH MACRON WITH MACRON 

24B6 24D0 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER A 
LETTERA 

24B7 24D1 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER B 


LETTER B 
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Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


24B8 24D2 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER C 
LETTER C 

24B9 24D3 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER D 
LETTER D 

24BA 24D4 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER E 
LETTER E 

24BB 24D5 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER F 
LETTER F 

24BC 24D6 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER G 
LETTER G 

24BD 24D7 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER H 
LETTER H 

24BE 24D8 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER | 
LETTER | 

24BF 24D9 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER J 
LETTER J 

24C0 24DA CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER K 
LETTER K 

24C1 24DB CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER L 
LETTER L 

24C2 24DC CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER M 
LETTER M 

2403 24DD CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER N 
LETTER N 

2404 24DE CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER O 
LETTER O 

24C5 24DF CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER P 
LETTER P 

24C6 24E0 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER Q 
LETTER Q 

24C7 24E1 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER R 
LETTER R 

24C8 24E2 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER S 
LETTER S 

24C9 24E3 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER T 
LETTER T 

24CA 24E4 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER U 
LETTER U 

24CB 24E5 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER V 
LETTER V 

24CC 24E6 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER W 
LETTER W 

24CD 24E7 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER X 
LETTER X 

24CE 24E8 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER Y 


LETTER Y 


46 iSeries: Globalization (Handle data in global applications) 


Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


24CF 24E9 CIRCLED LATIN CAPITAL CIRCLED LATIN SMALL LETTER Z 
LETTER Z 

FF21 FF41 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTERA A 

FF22 FF42 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER B B 

FF23 FF43 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER C Cc 

FF24 FF44 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER D D 

FF25 FF45 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER E E 

FF26 FF46 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER F F 

FF27 FF47 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER G G 

FF28 FF48 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER H H 

FF29 FF49 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER | I 

FF2A FF4A FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER J J 

FF2B FF4B FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER K K 

FF2C FF4C FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER L IL 

FF2D FF4D FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER M M 

FF2E FF4E FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER N N 

FF2F FF4F FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER O O 

FF30 FF50 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER P P 

FF31 FF51 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER Q Q 

FF32 FF52 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER R R 

FF33 FF53 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER S S) 

FF34 FF54 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER T T 

FF35 FF55 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER U U 

FF36 FF56 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 


LETTER V 


V 
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Uppercase code 
point 


Lowercase code 
point 


Uppercase character 
description 


Lowercase character description 


FF37 FF57 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER W W 

FF38 FF58 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER X X 

FF39 FF59 FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 
LETTER Y ‘4 

FF3A FF5A FULLWIDTH LATIN CAPITAL FULLWIDTH LATIN SMALL LETTER 


LETTER Z 


Z 


ISO 10646 lowercase to uppercase mapping table: 


Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


0061 0041 LATIN SMALL LETTER A LATIN CAPITAL LETTER A 
0062 0042 LATIN SMALL LETTER B LATIN CAPITAL LETTER B 
0063 0043 LATIN SMALL LETTER C LATIN CAPITAL LETTER C 
0064 0044 LATIN SMALL LETTER D LATIN CAPITAL LETTER D 
0065 0045 LATIN SMALL LETTER E LATIN CAPITAL LETTER E 
0066 0046 LATIN SMALL LETTER F LATIN CAPITAL LETTER F 
0067 0047 LATIN SMALL LETTER G LATIN CAPITAL LETTER G 
0068 0048 LATIN SMALL LETTER H LATIN CAPITAL LETTER H 
0069 0049 LATIN SMALL LETTER | LATIN CAPITAL LETTER | 
006A 004A LATIN SMALL LETTER J LATIN CAPITAL LETTER J 
006B 004B LATIN SMALL LETTER K LATIN CAPITAL LETTER K 
006C 004C LATIN SMALL LETTER L LATIN CAPITAL LETTER L 
006D 004D LATIN SMALL LETTER M LATIN CAPITAL LETTER M 
OO6E 004E LATIN SMALL LETTER N LATIN CAPITAL LETTER N 
OO6F 004F LATIN SMALL LETTER O LATIN CAPITAL LETTER O 
0070 0050 LATIN SMALL LETTER P LATIN CAPITAL LETTER P 
0071 0051 LATIN SMALL LETTER Q LATIN CAPITAL LETTER Q 
0072 0052 LATIN SMALL LETTER R LATIN CAPITAL LETTER R 
0073 0053 LATIN SMALL LETTER S LATIN CAPITAL LETTER S 
0074 0054 LATIN SMALL LETTER T LATIN CAPITAL LETTER T 
0075 0055 LATIN SMALL LETTER U LATIN CAPITAL LETTER U 
0076 0056 LATIN SMALL LETTER V LATIN CAPITAL LETTER V 
0077 0057 LATIN SMALL LETTER W LATIN CAPITAL LETTER W 
0078 0058 LATIN SMALL LETTER X LATIN CAPITAL LETTER X 
0079 0059 LATIN SMALL LETTER Y LATIN CAPITAL LETTER Y 
007A 005A LATIN SMALL LETTER Z LATIN CAPITAL LETTER Z 
OOEO 00CO0 LATIN SMALL LETTER A GRAVE | LATIN CAPITAL LETTER A GRAVE 
00E1 00C1 LATIN SMALL LETTER A GRAVE |LATIN CAPITAL LETTER A ACUTE 


48 iSeries: Globalization (Handle data in global applications) 


Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


O0E2 00C2 LATIN SMALL LETTER A GRAVE _|LATIN CAPITAL LETTER A 
CIRCUMFLEX 
00E3 00C3 LATIN SMALL LETTER A GRAVE | LATIN CAPITAL LETTER A TILDE 
00E4 00C4 LATIN SMALL LETTER A GRAVE _|LATIN CAPITAL LETTER A 
DIAERESIS 

OOE5 00C5 LATIN SMALL LETTER A GRAVE {LATIN CAPITAL LETTER A RING 

OOE6 00C6 LATIN SMALL LETTER A GRAVE {LATIN CAPITAL LETTER AE 

00E7 00C7 LATIN SMALL LETTER A GRAVE _|LATIN CAPITAL LETTER C CEDILLA 

O0O0E8 00C8 LATIN SMALL LETTER A GRAVE _|LATIN CAPITAL LETTER E GRAVE 

OOE9 00Cc9 LATIN SMALL LETTER A GRAVE |LATIN CAPITAL LETTER E ACUTE 

OOEA O0O0CA LATIN SMALL LETTER E LATIN CAPITAL LETTER E 
CIRCUMFLEX CIRCUMFLEX 

0O0EB 00CB LATIN SMALL LETTER E LATIN CAPITAL LETTER E 
DIAERESIS DIAERESIS 

OOEC 00CC LATIN SMALL LETTER | GRAVE — | LATIN CAPITAL LETTER | GRAVE 

OOED 00CD LATIN SMALL LETTER | ACUTE | LATIN CAPITAL LETTER | ACUTE 

OOEE OOCE LATIN SMALL LETTER | LATIN CAPITAL LETTER | 
CIRCUMFLEX CIRCUMFLEX 

OOEF OOCF LATIN SMALL LETTER | LATIN CAPITAL LETTER | 
DIAERESIS DIAERESIS 

OOFO 00DO LATIN SMALL LETTER ETH LATIN CAPITAL LETTER ETH 

OOF 1 o0oD1 LATIN SMALL LETTER N TILDE LATIN CAPITAL LETTER N TILDE 

OOF2 00D2 LATIN SMALL LETTER O GRAVE | LATIN CAPITAL LETTER O GRAVE 

OOF3 00D3 LATIN SMALL LETTER O ACUTE _| LATIN CAPITAL LETTER O ACUTE 

OOF4 00D4 LATIN SMALL LETTER O LATIN CAPITAL LETTER O 
CIRCUMFLEX CIRCUMFLEX 

OOF5 00D5 LATIN SMALL LETTER O TILDE | LATIN CAPITAL LETTER O TILDE 

OOF6 00D6 LATIN SMALL LETTER O LATIN CAPITAL LETTER O 
DIAERESIS DIAERESIS 

OOF8 00D8 LATIN SMALL LETTER O SLASH _ | LATIN CAPITAL LETTER O SLASH 

OOF9 0O0D9 LATIN SMALL LETTER U GRAVE _|LATIN CAPITAL LETTER U GRAVE 

OOFA OODA LATIN SMALL LETTER U ACUTE _|LATIN CAPITAL LETTER U ACUTE 

OOFB 0O0DB LATIN SMALL LETTER U LATIN CAPITAL LETTER U 
CIRCUMFLEX CIRCUMFLEX 

OOFC 00DC LATIN SMALL LETTER U LATIN CAPITAL LETTER U 
DIAERESIS DIAERESIS 

OOFD 0oDD LATIN SMALL LETTER Y ACUTE _|LATIN CAPITAL LETTER Y ACUTE 

OOFE OODE LATIN SMALL LETTER THORN LATIN CAPITAL LETTER THORN 

OOFF 0178 LATIN SMALL LETTER Y LATIN CAPITAL LETTER Y WITH 
DIAERESIS DIAERESIS 

0101 0100 LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
MACRON MACRON 

0103 0102 LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 


BREVE 


BREVE 
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Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


0105 0104 LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
OGONEK OGONEK 

0107 0106 LATIN SMALL LETTER C WITH LATIN CAPITAL LETTER C WITH 
ACUTE ACUTE 

0109 0108 LATIN SMALL LETTER C WITH LATIN CAPITAL LETTER C WITH 
CIRCUMFLEX CIRCUMFLEX 

010B 010A LATIN SMALL LETTER C WITH LATIN CAPITAL LETTER C WITH 
DOT ABOVE DOT ABOVE 

010D 010C LATIN SMALL LETTER C WITH LATIN CAPITAL LETTER C WITH 
CARON CARON 

010F 010E LATIN SMALL LETTER D WITH LATIN CAPITAL LETTER D WITH 
CARON CARON 

0111 0110 LATIN SMALL LETTER D WITH LATIN CAPITAL LETTER D WITH 
STROKE STROKE 

0113 0112 LATIN SMALL LETTER E WITH LATIN CAPITAL LETTER E WITH 
MACRON MACRON 

0115 0114 LATIN SMALL LETTER E WITH LATIN CAPITAL LETTER E WITH 
BREVE BREVE 

0117 0116 LATIN SMALL LETTER E WITH LATIN CAPITAL LETTER E WITH 
DOT ABOVE DOT ABOVE 

0119 0118 LATIN SMALL LETTER E WITH LATIN CAPITAL LETTER E WITH 
OGONEK OGONEK 

011B O11A LATIN SMALL LETTER E WITH LATIN CAPITAL LETTER E WITH 
CARON CARON 

011D 011C LATIN SMALL LETTER G WITH LATIN CAPITAL LETTER G WITH 
CIRCUMFLEX CIRCUMFLEX 

011F O11E LATIN SMALL LETTER G WITH LATIN CAPITAL LETTER G WITH 
BREVE BREVE 

0121 0120 LATIN SMALL LETTER G WITH LATIN CAPITAL LETTER G WITH 
DOT ABOVE DOT ABOVE 

0123 0122 LATIN SMALL LETTER G WITH LATIN CAPITAL LETTER G WITH 
CEDILLA CEDILLA 

0125 0124 LATIN SMALL LETTER H WITH LATIN CAPITAL LETTER H WITH 
CIRCUMFLEX CIRCUMFLEX 

0127 0126 LATIN SMALL LETTER H WITH LATIN CAPITAL LETTER H WITH 
STROKE STROKE 

0129 0128 LATIN SMALL LETTER | WITH LATIN CAPITAL LETTER | WITH 
TILDE TILDE 

012B 012A LATIN SMALL LETTER | WITH LATIN CAPITAL LETTER | WITH 
MACRON MACRON 

012D 012C LATIN SMALL LETTER | WITH LATIN CAPITAL LETTER | WITH 
BREVE BREVE 

012F 012E LATIN SMALL LETTER | WITH LATIN CAPITAL LETTER | WITH 
OGONEK OGONEK 

0131 0049 LATIN SMALL LETTER DOTLESS | LATIN CAPITAL LETTER | 
| 

0133 0132 LATIN SMALL LIGATURE lJ LATIN CAPITAL LIGATURE lJ 


50 iSeries: Globalization (Handle data in global applications) 


Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


0135 0134 LATIN SMALL LETTER J WITH ‘| LATIN CAPITAL LETTER J WITH 
CIRCUMFLEX CIRCUMFLEX 

0137 0136 LATIN SMALL LETTER K WITH — | LATIN CAPITAL LETTER K WITH 
CEDILLA CEDILLA 

013A 0139 LATIN SMALL LETTER L WITH — | LATIN CAPITAL LETTER L WITH 
ACUTE ACUTE 

013C 013B LATIN SMALL LETTER L WITH — | LATIN CAPITAL LETTER L WITH 
CEDILLA CEDILLA 

013E 013D LATIN SMALL LETTER L WITH _ | LATIN CAPITAL LETTER L WITH 
CARON CARON 

0140 013F LATIN SMALL LETTER L WITH _ | LATIN CAPITAL LETTER L WITH 
MIDDLE DOT MIDDLE DOT 

0142 0141 LATIN SMALL LETTER L WITH _ | LATIN CAPITAL LETTER L WITH 
STROKE STROKE 

0144 0143 LATIN SMALL LETTER N WITH — |LATIN CAPITAL LETTER N WITH 
ACUTE ACUTE 

0146 0145 LATIN SMALL LETTER N WITH {LATIN CAPITAL LETTER N WITH 
CEDILLA CEDILLA 

0148 0147 LATIN SMALL LETTER N WITH — {LATIN CAPITAL LETTER N WITH 
CARON CARON 

014B 014A LATIN SMALL LETTER ENG LATIN CAPITAL LETTER ENG 
(SAMI) (SAMI) 

014D 014C LATIN SMALL LETTER O WITH | LATIN CAPITAL LETTER O WITH 
MACRON MACRON 

014F 014E LATIN SMALL LETTER O WITH _ | LATIN CAPITAL LETTER O WITH 
BREVE BREVE 

0151 0150 LATIN SMALL LETTER O WITH | LATIN CAPITAL LETTER O WITH 
DOUBLE ACUTE DOUBLE ACUTE 

0153 0152 LATIN SMALL LIGATURE OE LATIN CAPITAL LIGATURE OE 

0155 0154 LATIN SMALL LETTER R WITH _ {LATIN CAPITAL LETTER R WITH 
ACUTE ACUTE 

0157 LATIN SMALL LETTER R WITH — {LATIN CAPITAL LETTER R WITH 
CEDILLA CEDILLA 

0159 LATIN SMALL LETTER R WITH — |LATIN CAPITAL LETTER R WITH 
CARON CARON 

015B LATIN SMALL LETTER S WITH __|LATIN CAPITAL LETTER S WITH 
ACUTE ACUTE 

015D LATIN SMALL LETTER S WITH __|LATIN CAPITAL LETTER S WITH 
CIRCUMFLEX CIRCUMFLEX 

O15F LATIN SMALL LETTER S WITH __|LATIN CAPITAL LETTER S WITH 
CEDILLA CEDILLA 

0161 LATIN SMALL LETTER S WITH __|LATIN CAPITAL LETTER S WITH 
CARON CARON 

0163 LATIN SMALL LETTER T WITH _ | LATIN CAPITAL LETTER T WITH 
CEDILLA CEDILLA 

0165 LATIN SMALL LETTER T WITH — | LATIN CAPITAL LETTER T WITH 


CARON 


CARON 
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Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


0167 0166 LATIN SMALL LETTER T WITH LATIN CAPITAL LETTER T WITH 
STROKE STROKE 

0169 0168 LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
TILDE TILDE 

016B 016A LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
MACRON MACRON 

016D 016C LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
BREVE BREVE 

016F O016E LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
RING ABOVE RING ABOVE 

0171 0170 LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
DOUBLE ACUTE DOUBLE ACUTE 

0173 0172 LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
OGONEK OGONEK 

0175 0174 LATIN SMALL LETTER W WITH LATIN CAPITAL LETTER W WITH 
CIRCUMFLEX CIRCUMFLEX 

0177 0176 LATIN SMALL LETTER Y WITH LATIN CAPITAL LETTER Y WITH 
CIRCUMFLEX CIRCUMFLEX 

017A 0179 LATIN SMALL LETTER Z WITH LATIN CAPITAL LETTER Z WITH 
ACUTE ACUTE 

017C 017B LATIN SMALL LETTER Z WITH LATIN CAPITAL LETTER Z WITH 
DOT ABOVE DOT ABOVE 

017E 017D LATIN SMALL LETTER Z WITH LATIN CAPITAL LETTER Z WITH 
CARON CARON 

0183 0182 LATIN SMALL LETTER B WITH LATIN CAPITAL LETTER B WITH 
TOPBAR TOPBAR 

0185 0184 LATIN SMALL LETTER TONE SIX |LATIN CAPITAL LETTER TONE SIX 

0188 0187 LATIN SMALL LETTER C WITH LATIN CAPITAL LETTER C WITH 
HOOK HOOK 

018C 018B LATIN SMALL LETTER D WITH LATIN CAPITAL LETTER D WITH 
TOPBAR TOPBAR 

0192 0191 LATIN SMALL LETTER F WITH LATIN CAPITAL LETTER F WITH 
HOOK HOOK 

0199 0198 LATIN SMALL LETTER K WITH LATIN CAPITAL LETTER K WITH 
HOOK HOOK 

01A1 01A0 LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
HORN HORN 

01A3 01A2 LATIN SMALL LETTER Ol LATIN CAPITAL LETTER Ol 

01A5 01A4 LATIN SMALL LETTER P WITH LATIN CAPITAL LETTER P WITH 
HOOK HOOK 

01A8 01A7 LATIN SMALL LETTER TONE LATIN CAPITAL LETTER TONE 
TWO TWO 

01AD 01AC LATIN SMALL LETTER T WITH LATIN CAPITAL LETTER T WITH 
HOOK HOOK 

01BO O1AF LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 


HORN 


HORN 


52 iSeries: Globalization (Handle data in global applications) 


Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


01B4 01B3 LATIN SMALL LETTER Y WITH LATIN CAPITAL LETTER Y WITH 
HOOK HOOK 

01B6 01B5 LATIN SMALL LETTER Z WITH LATIN CAPITAL LETTER Z WITH 
STROKE STROKE 

01B9 01B8 LATIN SMALL LETTER EZH LATIN CAPITAL LETTER EZH 
REVERSED REVERSED 

01BD 01BC LATIN SMALL LETTER TONE LATIN CAPITAL LETTER TONE 
FIVE FIVE 

01C6 0104 LATIN SMALL LETTER DZ WITH | LATIN CAPITAL LETTER DZ WITH 
CARON CARON 

01C9 01C7 LATIN SMALL LETTER LJ LATIN CAPITAL LETTER LJ 

01CC 01CA LATIN SMALL LETTER NJ LATIN CAPITAL LETTER NJ 

01CE 01CD LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
CARON CARON 

01D0 01CF LATIN SMALL LETTER | WITH LATIN CAPITAL LETTER | WITH 
CARON CARON 

01D2 01D1 LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
CARON CARON 

01D4 01D3 LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
CARON CARON 

01D6 01D5 LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
DIAERESIS AND MACRON DIAERESIS AND MACRON 

01D8 01D7 LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
DIAERESIS AND ACUTE DIAERESIS AND ACUTE 

01DA 01D9 LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
DIAERESIS AND CARON DIAERESIS AND CARON 

01DC 01DB LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
DIAERESIS AND GRAVE DIAERESIS AND GRAVE 

01DF 01DE LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
DIAERESIS AND MACRON DIAERESIS AND MACRON 

01E1 01E0 LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
DOT ABOVE AND MACRON DOT ABOVE AND MACRON 

01E3 01E2 LATIN SMALL LIGATURE AE LATIN CAPITAL LIGATURE AE MTH 
WITH MACRON MACRON 

01E5 01E4 LATIN SMALL LETTER G WITH LATIN CAPITAL LETTER G WITH 
STROKE STROKE 

01E7 LATIN SMALL LETTER G WITH LATIN CAPITAL LETTER G WITH 
CARON CARON 

01E9 LATIN SMALL LETTER K WITH LATIN CAPITAL LETTER K WITH 
CARON CARON 

01EB LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
OGONEK OGONEK 

01ED LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
OGONEK AND MACRON OGONEK AND MACRON 

O1EF O1EE LATIN SMALL LETTER EZH WITH | LATIN CAPITAL LETTER EZH WITH 
CARON CARON 

01F3 O1F1 LATIN SMALL LETTER DZ LATIN CAPITAL LETTER DZ 
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Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


01F5 01F4 LATIN SMALL LETTER G WITH LATIN CAPITAL LETTER G WITH 
ACUTE ACUTE 

01FB O1FA LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
RING ABOVE AND ACUTE RING ABOVE AND ACUTE 

01FD 01FC LATIN SMALL LIGATURE AE LATIN CAPITAL LIGATURE AE WITH 
WITH ACUTE ACUTE 

O1FF O1FE LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
STROKE AND ACUTE STROKE AND ACUTE 

0201 0200 LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
DOUBLE GRAVE DOUBLE GRAVE 

0203 0202 LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
INVERTED BREVE INVERTED BREVE 

0205 0204 LATIN SMALL LETTER E WITH LATIN CAPITAL LETTER E WITH 
DOUBLE GRAVE DOUBLE GRAVE 

0207 0206 LATIN SMALL LETTER E WITH LATIN CAPITAL LETTER E WITH 
INVERTED BREVE INVERTED BREVE 

0209 0208 LATIN SMALL LETTER | WITH LATIN CAPITAL LETTER | WITH 
DOUBLE GRAVE DOUBLE GRAVE 

020B 020A LATIN SMALL LETTER | WITH LATIN CAPITAL LETTER | WITH 
INVERTED BREVE INVERTED BREVE 

020D 020C LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
DOUBLE GRAVE DOUBLE GRAVE 

020F 020E LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
INVERTED BREVE INVERTED BREVE 

0211 0210 LATIN SMALL LETTER R WITH LATIN CAPITAL LETTER R WITH 
DOUBLE GRAVE DOUBLE GRAVE 

0213 0212 LATIN SMALL LETTER R WITH LATIN CAPITAL LETTER R WITH 
INVERTED BREVE INVERTED BREVE 

0215 0214 LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
DOUBLE GRAVE DOUBLE GRAVE 

0217 0216 LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
INVERTED BREVE INVERTED BREVE 

0253 0181 LATIN SMALL LETTER B WITH LATIN CAPITAL LETTER B WITH 
HOOK HOOK 

0254 0186 LATIN SMALL LETTER OPEN O _— {LATIN CAPITAL LETTER OPEN O 

0257 018A LATIN SMALL LETTER D WITH LATIN CAPITAL LETTER D WITH 
HOOK HOOK 

0258 018E LATIN SMALL LETTER LATIN CAPITAL LETTER 
REVERSED E REVERSED E 

0259 018F LATIN SMALL LETTER SCHWA LATIN CAPITAL LETTER SCHWA 

025B 0190 LATIN SMALL LETTER OPEN E LATIN CAPITAL LETTER OPEN E 

0260 0193 LATIN SMALL LETTER G WITH LATIN CAPITAL LETTER G WITH 
HOOK HOOK 

0263 0194 LATIN SMALL LETTER GAMMA LATIN CAPITAL LETTER GAMMA 

0268 0197 LATIN SMALL LETTER | WITH LATIN CAPITAL LETTER | WITH 


STROKE 


STROKE 


54 iSeries: Globalization (Handle data in global applications) 


Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


0269 0196 LATIN SMALL LETTER IOTA LATIN CAPITAL LETTER IOTA 

026F 019C LATIN SMALL LETTER TURNED | LATIN CAPITAL LETTER TURNED 
M M 

0272 019D LATIN SMALL LETTER N WITH LATIN CAPITAL LETTER N WITH 
LEFT HOOK LEFT HOOK 

0275 019F LATIN SMALL LETTER BARRED _ | LATIN CAPITAL LETTER O WITH 
O MIDDLE TILDE 

0283 01A9 LATIN SMALL LETTER ESH LATIN CAPITAL LETTER ESH 

0288 01AE LATIN SMALL LETTER T WITH LATIN CAPITAL LETTER T WITH 
RETROFLEX HOOK RETROFLEX HOOK 

028A 01B1 LATIN SMALL LETTER UPSILON | LATIN CAPITAL LETTER UPSILON 

028B 01B2 LATIN SMALL LETTER V WITH LATIN CAPITAL LETTER V WITH 
HOOK HOOK 

0292 01B7 LATIN SMALL LETTER EZH LATIN CAPITAL LETTER EZH 

03AC 0386 GREEK SMALL LETTER ALPHA — | GREEK CAPITAL LETTER ALPHA 
WITH TONOS WITH TONOS 

03AD 0388 GREEK SMALL LETTER GREEK CAPITAL LETTER EPSILON 
EPSILON WITH TONOS WITH TONOS 

O3AE 0389 GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH TONOS WITH TONOS 

O3AF 038A GREEK SMALL LETTER IOTA GREEK CAPITAL LETTER IOTA 
WITH TONOS WITH TONOS 

03B1 0391 GREEK SMALL LETTER ALPHA — | GREEK CAPITAL LETTER ALPHA 

03B2 0392 GREEK SMALL LETTER BETA GREEK CAPITAL LETTER BETA 

03B3 0393 GREEK SMALL LETTER GAMMA | GREEK CAPITAL LETTER GAMMA 

03B4 0394 GREEK SMALL LETTER DELTA — | GREEK CAPITAL LETTER DELTA 

03B5 0395 GREEK SMALL LETTER GREEK CAPITAL LETTER EPSILON 
EPSILON 

03B6 0396 GREEK SMALL LETTER ZETA GREEK CAPITAL LETTER ZETA 

03B7 0397 GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 

03B8 0398 GREEK SMALL LETTER THETA =| GREEK CAPITAL LETTER THETA 

03B9 0399 GREEK SMALL LETTER IOTA GREEK CAPITAL LETTER IOTA 

03BA 039A GREEK SMALL LETTER KAPPA | GREEK CAPITAL LETTER KAPPA 

03BB 039B GREEK SMALL LETTER LAMDA_ | GREEK CAPITAL LETTER LAMDA 

03BC 039C GREEK SMALL LETTER MU GREEK CAPITAL LETTER MU 

03BD 039D GREEK SMALL LETTER NU GREEK CAPITAL LETTER NU 

O3BE 039E GREEK SMALL LETTER XI GREEK CAPITAL LETTER XI 

O3BF 039F GREEK SMALL LETTER GREEK CAPITAL LETTER 
OMICRON OMICRON 

03C0 03A0 GREEK SMALL LETTER PI GREEK CAPITAL LETTER PI 

03C1 03A1 GREEK SMALL LETTER RHO GREEK CAPITAL LETTER RHO 

03C3 03A3 GREEK SMALL LETTER SIGMA | GREEK CAPITAL LETTER SIGMA 

0304 03A4 GREEK SMALL LETTER TAU GREEK CAPITAL LETTER TAU 
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Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


03C5 03A5 GREEK SMALL LETTER GREEK CAPITAL LETTER UPSILON 
UPSILON 
03C6 03A6 GREEK SMALL LETTER PHI GREEK CAPITAL LETTER PHI 
03C7 03A7 GREEK SMALL LETTER CHI GREEK CAPITAL LETTER CHI 
03C8 03A8 GREEK SMALL LETTER PSI GREEK CAPITAL LETTER PSI 
03C9 03A9 GREEK SMALL LETTER OMEGA _| GREEK CAPITAL LETTER OMEGA 
03CA O3AA GREEK SMALL LETTER IOTA GREEK CAPITAL LETTER IOTA 
WITH DIALYTIKA WITH DIALYTIKA 
03CB 03AB GREEK SMALL LETTER GREEK CAPITAL LETTER UPSILON 
UPSILON WITH DIALYTIKA WITH DIALYTIKA 
03CC 038C GREEK SMALL LETTER GREEK CAPITAL LETTER 
OMICRON WITH TONOS OMICRON WITH TONOS 
03CD 038E GREEK SMALL LETTER GREEK CAPITAL LETTER UPSILON 
UPSILON WITH TONOS WITH TONOS 
03CE 038F GREEK SMALL LETTER OMEGA _| GREEK CAPITAL LETTER OMEGA 
WITH TONOS WITH TONOS 
03E3 03E2 COPTIC SMALL LETTER SHEI COPTIC CAPITAL LETTER SHEI 
03E5 03E4 COPTIC SMALL LETTER FEI COPTIC CAPITAL LETTER FEI 
03E7 03E6 COPTIC SMALL LETTER KHEI COPTIC CAPITAL LETTER KHEI 
03E9 03E8 COPTIC SMALL LETTER HORI COPTIC CAPITAL LETTER HORI 
03EB O3EA COPTIC SMALL LETTER GANGIA | COPTIC CAPITAL LETTER GANGIA 
03ED 03EC COPTIC SMALL LETTER SHIMA_ | COPTIC CAPITAL LETTER SHIMA 
O3EF O3EE COPTIC SMALL LETTER DEI COPTIC CAPITAL LETTER DEI 
0430 0410 CYRILLIC SMALL LETTER A CYRILLIC CAPITAL LETTER A 
0431 0411 CYRILLIC SMALL LETTER BE CYRILLIC CAPITAL LETTER BE 
0432 0412 CYRILLIC SMALL LETTER VE CYRILLIC CAPITAL LETTER VE 
0433 0413 CYRILLIC SMALL LETTER GHE =| CYRILLIC CAPITAL LETTER GHE 
0434 0414 CYRILLIC SMALL LETTER DE CYRILLIC CAPITAL LETTER DE 
0435 0415 CYRILLIC SMALL LETTER IE CYRILLIC CAPITAL LETTER IE 
0436 0416 CYRILLIC SMALL LETTER ZHE CYRILLIC CAPITAL LETTER ZHE 
0437 0417 CYRILLIC SMALL LETTER ZE CYRILLIC CAPITAL LETTER ZE 
0438 0418 CYRILLIC SMALL LETTER | CYRILLIC CAPITAL LETTER | 
0439 0419 CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER SHORT 
SHORT | | 
043A 041A CYRILLIC SMALL LETTER KA CYRILLIC CAPITAL LETTER KA 
043B 041B CYRILLIC SMALL LETTER EL CYRILLIC CAPITAL LETTER EL 
043C 041C CYRILLIC SMALL LETTER EM CYRILLIC CAPITAL LETTER EM 
043D 041D CYRILLIC SMALL LETTER EN CYRILLIC CAPITAL LETTER EN 
043E O41E CYRILLIC SMALL LETTER O CYRILLIC CAPITAL LETTER O 
043F 041F CYRILLIC SMALL LETTER PE CYRILLIC CAPITAL LETTER PE 
0440 0420 CYRILLIC SMALL LETTER ER CYRILLIC CAPITAL LETTER ER 
0441 0421 CYRILLIC SMALL LETTER ES CYRILLIC CAPITAL LETTER ES 


56 iSeries: Globalization (Handle data in global applications) 


Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


0442 0422 CYRILLIC SMALL LETTER TE CYRILLIC CAPITAL LETTER TE 

0443 0423 CYRILLIC SMALL LETTER U CYRILLIC CAPITAL LETTER U 

0444 0424 CYRILLIC SMALL LETTER EF CYRILLIC CAPITAL LETTER EF 

0445 0425 CYRILLIC SMALLLETTER HA | CYRILLIC CAPITAL LETTER HA 

0446 0426 CYRILLIC SMALL LETTER TSE | CYRILLIC CAPITAL LETTER TSE 

0447 0427 CYRILLIC SMALL LETTER CHE | CYRILLIC CAPITAL LETTER CHE 

0448 0428 CYRILLIC SMALL LETTER SHA | CYRILLIC CAPITAL LETTER SHA 

0449 0429 CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER SHCHA 
SHCHA 

044A 042A CYRILLIC SMALL LETTER HARD | CYRILLIC CAPITAL LETTER HARD 
SIGN SIGN 

044B 042B CYRILLIC SMALL LETTER YERU | CYRILLIC CAPITAL LETTER YERU 

044C 042C CYRILLIC SMALL LETTER SOFT | CYRILLIC CAPITAL LETTER SOFT 
SIGN SIGN 

044D 042D CYRILLIC SMALL LETTER E CYRILLIC CAPITAL LETTER E 

044E 042E CYRILLIC SMALL LETTER YU CYRILLIC CAPITAL LETTER YU 

044F 042F CYRILLIC SMALL LETTER YA CYRILLIC CAPITAL LETTER YA 

0451 0401 CYRILLIC SMALL LETTER IO CYRILLIC CAPITAL LETTER IO 

0452 0402 CYRILLIC SMALL LETTER DJE_ | CYRILLIC CAPITAL LETTER DJE 
(SERBOCROATIAN) (SERBOCROATIAN) 

0453 0403 CYRILLIC SMALL LETTER GJE_ | CYRILLIC CAPITAL LETTER GJE 

0454 0404 CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
UKRAINIAN IE UKRAINIAN IE 

0455 0405 CYRILLIC SMALL LETTER DZE | CYRILLIC CAPITAL LETTER DZE 

0456 0406 CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
BYELORUSSIAN-UKRAINIAN | | BYELORUSSIAN_UKRAINIAN I 

0457 0407 CYRILLIC SMALL LETTER YI CYRILLIC CAPITAL LETTER YI 
(UKRANIAN) (UKRANIAN) 

0458 0408 CYRILLIC SMALL LETTER JE CYRILLIC CAPITAL LETTER JE 

0459 0409 CYRILLIC SMALL LETTER LJE ‘| CYRILLIC CAPITAL LETTER LJE 

045A 040A CYRILLIC SMALL LETTER NJE_ | CYRILLIC CAPITAL LETTER NJE 

045B 040B CYRILLIC SMALL LETTER TSHE | CYRILLIC CAPITAL LETTER TSHE 
(SERBOCROATIAN) (SERBOCROATIAN) 

045C 040 CYRILLIC SMALL LETTER KJE | CYRILLIC CAPITAL LETTER KJE 

045E 040E CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER SHORT 
SHORT U (BYELORUSSIAN) U (BYELORUSSIAN) 

045F 040F CYRILLIC SMALL LETTER DZHE | CYRILLIC CAPITAL LETTER DZHE 

0461 0460 CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
OMEGA OMEGA 

0463 0462 CYRILLIC SMALL LETTER YAT _| CYRILLIC CAPITAL LETTER YAT 

0465 0464 CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
lOTIFIED E lOTIFIED E 

0467 0466 CYRILLIC SMALL LETTER LITTLE | CYRILLIC CAPITAL LETTER LITTLE 


YUS 


YUS 


OS/400 globalization 57 


Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


0469 0468 CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
IOTFIED LITTLE YUS IOTIFIED LITTLE YUS 

046B 046A CYRILLIC SMALL LETTER BIG CYRILLIC CAPITAL LETTER BIG 
YUS YUS 

046D 046C CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
IOTIFIED BIG YUS IOTIFIED BIG YUS 

046F 046E CYRILLIC SMALL LETTER KSI CYRILLIC CAPITAL LETTER KSI 

0471 0470 CYRILLIC SMALL LETTER PSI CYRILLIC CAPITAL LETTER PSI 

0473 0472 CYRILLIC SMALL LETTER FITA =| CYRILLIC CAPITAL LETTER FITA 

0475 0474 CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
IZHITSA IZHITSA 

0477 0476 CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
IZHITSA WITH DOUBLE GRAVE _ | IZHITSA WITH DOUBLE GRAVE 
ACCENT ACCENT 

0479 0478 CYRILLIC SMALL LETTER UK CYRILLIC CAPITAL LETTER UK 

047B 047A CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
ROUND OMEGA ROUND OMEGA 

047D 047C CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
OMEGA WITH TITLO OMEGA WITH TITLO 

047F 047E CYRILLIC SMALL LETTER OT CYRILLIC CAPITAL LETTER OT 

0481 0480 CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER KOPPA 
KOPPA 

0491 0490 CYRILLIC SMALL LETTER GHE =| CYRILLIC CAPITAL LETTER GHE 
WITH UPTURN WITH UPTURN 

0493 0492 CYRILLIC SMALL LETTER GHE =| CYRILLIC CAPITAL LETTER GHE 
WITH STROKE WITH STROKE 

0495 0494 CYRILLIC SMALL LETTER GHE | CYRILLIC CAPITAL LETTER GHE 
WITH MIDDLE HOOK WITH MIDDLE HOOK 

0497 0496 CYRILLIC SMALL LETTER ZHE CYRILLIC CAPITAL LETTER ZHE 
WITH DESCENDER WITH DESCENDER 

0499 0498 CYRILLIC SMALL LETTER ZE CYRILLIC CAPITAL LETTER ZE 
WITH DESCENDER WITH DESCENDER 

049B 049A CYRILLIC SMALL LETTER KA CYRILLIC CAPITAL LETTER KA 
WITH DESCENDER WITH DESCENDER 

049D 049C CYRILLIC SMALL LETTER KA CYRILLIC CAPITAL LETTER KA 
WITH VERTICAL STROKE WITH VERTICAL STROKE 

049F 049E CYRILLIC SMALL LETTER KA CYRILLIC CAPITAL LETTER KA 
WITH STROKE WITH STROKE 

04A1 04A0 CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
EASHKIR KA BASHKIR KA 

04A3 04A2 CYRILLIC SMALL LETTER EN CYRILLIC CAPITAL LETTER EN 
WITH DESCENOER WITH DESCENDER 

04A5 04A4 CYRILLIC SMALL LIGATURE EN | CYRILLIC CAPITAL LIGATURE EN 
GHE GHF 

04A7 04A6 CYRILLIC SMALL LETTER PE CYRILLIC CAPITAL LETTER PE 


WITH MIDDLE HOOK 
(ABKHASIAN) 


WITH MIDDLE HOOK (ABKHASIAN) 


58 iSeries: Globalization (Handle data in global applications) 


Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


04A9 04A8 CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
ABKHASIAN HA ABKHASIAN HA 

04AB 04AA CYRILLIC SMALL LETTER ES CYRILLIC CAPITAL LETTER ES 
WITH DESCENDER WITH DESCENDER 

04AD 04AC CYRILLIC SMALL LETTER TE CYRILLIC CAPITAL LETTER TE 
WITH DESCENDER WITH DESCENDER 

O4AF 04AE CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
STRAIGHT U STRAIGHT U 

04B1 04B0 CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
STRAIGHT U WITH STROKE STRAIGHT U WITH STROKE 

04B3 04B2 CYRILLIC SMALL LETTER HA CYRILLIC CAPITAL LETTER HA 
WITH DESCENDER WITH DESCENDER 

04B5 04B4 CYRILLIC SMALL LIGATURE TE | CYRILLIC CAPITAL LIGATURE TE 
TSE (ABKHASIAN) TSE (ABKHASIAN) 

04B7 04B6 CYRILLIC SMALL LETTER CHE =| CYRILLIC CAPITAL LETTER CHE 
WITH DESCENDER WITH DESCENDER 

04B9 04B8 CYRILLIC SMALL LETTER CHE | CYRILLIC CAPITAL LETTER CHE 
WITH VERTICAL STROKE WITH VERTICAL STROKE 

04BB 04BA CYRILLIC SMALL LETTER SHHA | CYRILLIC CAPITAL LETTER SHHA 

04BD 04BC CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
ABKHASIAN CHE ABKHASIAN CHE 

04BF 04BE CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
ABKHASIAN CHE WITH ABKHASIAN CHE WITH 
DESCENDER DESCENDER 

04C2 04C1 CYRILLIC SMALL LETTER ZHE CYRILLIC CAPITAL LETTER ZHE 
WITH BREVE WITH BREVE 

0404 CYRILLIC SMALL LETTER KA CYRILLIC CAPITAL LETTER KA 
WITH HOOK WITH HOOK 

04C8 CYRILLIC SMALL LETTER EN CYRILLIC CAPITAL LETTER EN 
WITH HOOK WITH HOOK 

04CC CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
KHAKASSIAN CHE KHAKASSIAN CHE 

04D1 CYRILLIC SMALL LETTER A CYRILLIC CAPITAL LETTER A WITH 
WITH BREVE BREVE 

04D3 04D2 CYRILLIC SMALL LETTER A CYRILLIC CAPITAL LETTER A WITH 
WITH DIAERESIS DIAERESIS 

04D5 04D4 CYRILLIC SMALL LIGATURE AIE | CYRILLIC CAPITAL LIGATURE A IE 

04D7 04D6 CYRILLIC SMALL LETTER IE CYRILLIC CAPITAL LETTER IE 
WITH BREVE WITH BREVE 

04D9 CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
SCHWA SCHWA 

04DB CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
SCHWA WITH DIAERESIS SCHWA WITH DIAERESIS 

04DD CYRILLIC SMALL LETTER ZHE CYRILLIC CAPITAL LETTER ZHE 
WITH DIAERESIS WITH DIAERESIS 

04DF CYRILLIC SMALL LETTER ZE CYRILLIC CAPITAL LETTER ZE 


WITH DIAERESIS 


WITH DIAERESIS 


OS/400 globalization 59 


Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


04E1 04E0 CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
ABKHASIAN DZE ABKHASIAN DZE 

04E3 04E2 CYRILLIC SMALL LETTER | WITH | CYRILLIC CAPITAL LETTER | WITH 
MACRON MACRON 

04E5 04E4 CYRILLIC SMALL LETTER | WITH | CYRILLIC CAPITAL LETTER | WITH 
DIAERESIS DIAERESIS 

04E7 04E6 CYRILLIC SMALL LETTER O CYRILLIC CAPITAL LETTER O 
WITH DIAERESIS WITH DIAERESIS 

04E9 04E8 CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
BARRED O BARRED O 

04EB O4EA CYRILLIC SMALL LETTER CYRILLIC CAPITAL LETTER 
BARRED O WITH DIAERESIS —_| BARRED O WITH DIAERESS 

O4EF O4EE CYRILLIC SMALL LETTER U CYRILLIC CAPITAL LETTER U 
WITH MACRON WITH MACRON 

O4F 1 04FO CYRILLIC SMALL LETTER U CYRILLIC CAPITAL LETTER U 
WITH DIAERESIS WITH DIAERESIS 

04F3 04F2 CYRILLIC SMALL LETTER U CYRILLIC CAPITAL LETTER U 
WITH DOUBLE ACUTE WITH DOUBLE ACUTE 

O4F5 04F4 CYRILLIC SMALL LETTER CHE | CYRILLIC CAPITAL LETTER CHE 
AITH DIAERESIS WITH DIAERESIS 

04F9 04F8 CYRILLIC SMALL LETTER YERU | CYRILLIC CAPITAL LETTER YERU 
WITH DIAERESIS WITH DIAERESIS 

0561 0531 ARMENIAN SMALL LETTER AYB_ | ARMENIAN CAPITAL LETTER AYB 

0562 0532 ARMENIAN SMALL LETTER BEN | ARMENIAN CAPITAL LETTER BEN 

0563 0533 ARMENIAN SMALL LETTER GIM | ARMENIAN CAPITAL LETTER GIM 

0564 0534 ARMENIAN SMALL LETTER DA | ARMENIAN CAPITAL LETTER DA 

0565 0535 ARMENIAN SMALL LETTER ECH | ARMENIAN CAPITAL LETTER ECH 

0566 0536 ARMENIAN SMALL LETTER ZA | ARMENIAN CAPITAL LETTER ZA 

0567 0537 ARMENIAN SMALL LETTER EH_ | ARMENIAN CAPITAL LETTER EH 

0568 0538 ARMENIAN SMALL LETTER ET | ARMENIAN CAPITAL LETTER ET 

0569 0539 ARMENIAN SMALL LETTER TO | ARMENIAN CAPITAL LETTER TO 

056A 053A ARMENIAN SMALL LETTER ZHE | ARMENIAN CAPITAL LETTER ZHE 

056B 053B ARMENIAN SMALL LETTER INI | ARMENIAN CAPITAL LETTER INI 

056C 053C ARMENIAN SMALL LETTER LIWN | ARMENIAN CAPITAL LETTER LIWN 

056D 053D ARMENIAN SMALL LETTER XEH | ARMENIAN CAPITAL LETTER XEH 

056E 053E ARMENIAN SMALL LETTER CA | ARMENIAN CAPITAL LETTER CA 

056F 053F ARMENIAN SMALL LETTER KEN | ARMENIAN CAPITAL LETTER KEN 

0570 0540 ARMENIAN SMALL LETTER HO | ARMENIAN CAPITAL LETTER HO 

0571 0541 ARMENIAN SMALL LETTER JA | ARMENIAN CAPITAL LETTER JA 

0572 0542 ARMENIAN SMALL LETTER ARMENIAN CAPITAL LETTER 
GHAD GHAD 

0573 0543 ARMENIAN SMALL LETTER ARMENIAN CAPITAL LETTER 
CHEH CHEH 

0574 0544 ARMENIAN SMALL LETTER MEN | ARMENIAN CAPITAL LETTER MEN 


60 iSeries: Globalization (Handle data in global applications) 


Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


0575 0545 ARMENIAN SMALL LETTER YI | ARMENIAN CAPITAL LETTER YI 
0576 0546 ARMENIAN SMALL LETTER NOW | ARMENIAN CAPITAL LETTER NOW 
0577 0547 ARMENIAN SMALL LETTER SNA | ARMENIAN CAPITAL LETTER SHA 
0578 0548 ARMENIAN SMALL LETTER VO | ARMENIAN CAPITAL LETTER VO 
0579 0549 ARMENIAN SMALL LETTER CHA | ARMENIAN CAPITAL LETTER CHA 
057A 054A ARMENIAN SMALL LETTER PEH | ARMENIAN CAPITAL LETTER PEH 
057B 054B ARMENIAN SMALL LETTER ARMENIAN CAPITAL LETTER JHEH 
JHEH 
057C 054C ARMENIAN SMALL LETTER RA | ARMENIAN CAPITAL LETTER RA 
057D 054D ARMENIAN SMALL LETTER SEH | ARMENIAN CAPITAL LETTER SEH 
057E 054E ARMENIAN SMALL LETTER VEW | ARMENIAN CAPITAL LETTER VEW 
057F 054F ARMENIAN SMALL LETTER TIWN | ARMENIAN CAPITAL LETTER TIWN 
0580 0550 ARMENIAN SMALL LETTER REH | ARMENIAN CAPITAL LETTER REH 
0581 0551 ARMENIAN SMALL LETTER CO | ARMENIAN CAPITAL LETTER CO 
0582 0552 ARMENIAN SMALL LETTER ARMENIAN CAPITAL LETTER YIWN 
YIWN 
0583 0553 ARMENIAN SMALL LETTER ARMENIAN CAPITAL LETTER PIWR 
PIWP 
0584 0554 ARMENIAN SMALL LETTER KEH | ARMENIAN CAPITAL LETTER KEH 
0585 0555 ARMENIAN SMALL LETTER OH_ | ARMENIAN CAPITAL LETTER OH 
0586 0556 ARMENIAN SMALL LETTER FEH | ARMENIAN CAPITAL LETTER FEH 
10D0 10A0 GEORGIAN LETTER AN GEORGIAN CAPITAL LETTER AN 
(KHUTSURI) 
10D1 10A1 GEORGIAN LETTER BAN GEORGIAN CAPITAL LETTER BAN 
(KHUTSURI) 
10D2 10A2 GEORGIAN LETTER GAN GEORGIAN CAPITAL LETTER GAN 
(KHUTSURI) 
10D3 10A3 GEORGIAN LETTER DON GEORGIAN CAPITAL LETTER DON 
(KHUTSURI) 
10D4 10A4 GEORGIAN LETTER EN GEORGIAN CAPITAL LETTER EN 
(KHUTSURI) 
10D5 10A5 GEORGIAN LETTER VIN GEORGIAN CAPITAL LETTER VIN 
(KHUTSURI) 
10D6 10A6 GEORGIAN LETTER ZEN GEORGIAN CAPITAL LETTER ZEN 
(KHUTSURI) 
10D7 10A7 GEORGIAN LETTER TAN GEORGIAN CAPITAL LETTER TAN 
(KHUTSURI) 
10D8 10A8 GEORGIAN LETTER IN GEORGIAN CAPITAL LETTER IN 
(KHUTSURI) 
10D9 10A9 GEORGIAN LETTER KAN GEORGIAN CAPITAL LETTER KAN 
(KHUTSURI) 
10DA 10AA GEORGIAN LETTER LAS GEORGIAN CAPITAL LETTER LAS 
(KHUTSURI) 
10DB 10AB GEORGIAN LETTER MAN GEORGIAN CAPITAL LETTER MAN 


(KHUTSURI) 
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Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


10DC 10AC GEORGIAN LETTER NAR GEORGIAN CAPITAL LETTER NAR 
(KHUTSURI) 

10DD 10AD GEORGIAN LETTER ON GEORGIAN CAPITAL LETTER ON 
(KHUTSURI) 

10DE 10AE GEORGIAN LETTER PAR GEORGIAN CAPITAL LETTER PAR 
(KHUTSURI) 

10DF 10AF GEORGIAN LETTER ZHAR GEORGIAN CAPITAL LETTER 
ZHAR (KHUTSURI) 

10E0 10B0 GEORGIAN LETTER RAE GEORGIAN CAPITAL LETTER RAE 
(KHUTSURI) 

10E1 10B1 GEORGIAN LETTER SAN GEORGIAN CAPITAL LETTER SAN 
(KHUTSURI) 

10E2 10B2 GEORGIAN LETTER TAR GEORGIAN CAPITAL LETTER TAR 
(KHUTSURI) 

10E3 10B3 GEORGIAN LETTER UN GEORGIAN CAPITAL LETTER UN 
(KHUTSURI) 

10E4 10B4 GEORGIAN LETTER PHAR GEORGIAN CAPITAL LETTER 
PHAR (KHUTSURI) 

10E5 10B5 GEORGIAN LETTER KHAR GEORGIAN CAPITAL LETTER 
KHAR (KHUTSURI) 

10E6 10B6 GEORGIAN LETTER GHAN GEORGIAN CAPITAL LETTER 
GHAN (KHUTSURI) 

10E7 10B7 GEORGIAN LETTER QAR GEORGIAN CAPITAL LETTER QAR 
(KHUTSURI) 

10E8 10B8 GEORGIAN LETTER SHIN GEORGIAN CAPITAL LETTER SHIN 
(KHUTSURI) 

10E9 10B9 GEORGIAN LETTER CHIN GEORGIAN CAPITAL LETTER CHIN 
(KHUTSURI) 

10EA 10BA GEORGIAN LETTER CAN GEORGIAN CAPITAL LETTER CAN 
(KHUTSURI) 

10EB 10BB GEORGIAN LETTER JIL GEORGIAN CAPITAL LETTER JIL 
(KHUTSURI) 

10EC 10BC GEORGIAN LETTER CIL GEORGIAN CAPITAL LETTER CIL 
(KHUTSURI) 

10ED 10BD GEORGIAN LETTER CHAR GEORGIAN CAPITAL LETTER 
CHAR (KHUTSURI) 

10EE 10BE GEORGIAN LETTER XAN GEORGIAN CAPITAL LETTER XAN 
(KHUTSURI) 

10EF 10BF GEORGIAN LETTER JHAN GEORGIAN CAPITAL LETTER JHAN 
(KHUTSURI) 

10FO 10C0 GEORGIAN LETTER HAE GEORGIAN CAPITAL LETTER HAE 
(KHUTSURI) 

10F1 10C1 GEORGIAN LETTER HE GEORGIAN CAPITAL LETTER HE 
(KHUTSURI) 

10F2 10C2 GEORGIAN LETTER HIE GEORGIAN CAPITAL LETTER HIE 


(KHUTSURI) 


62 iSeries: Globalization (Handle data in global applications) 


Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


10F3 10C3 GEORGIAN LETTER WE GEORGIAN CAPITAL LETTER WE 
(KHUTSURI) 
10F4 10C4 GEORGIAN LETTER HAR GEORGIAN CAPITAL LETTER HAR 
(KHUTSURI) 
10F5 10C5 GEORGIAN LETTER HOE GEORGIAN CAPITAL LETTER HOE 
(KHUTSURI) 
1E01 1E00 LATIN SMALL LETTER A WITH | LATIN CAPITAL LETTER A WITH 
RING BELOW RING BELOW 
1E03 1E02 LATIN SMALL LETTER B WITH __|LATIN CAPITAL LETTER B WITH 
DOT ABOVE DOT ABOVE 
1E05 1E04 LATIN SMALL LETTER B WITH __|LATIN CAPITAL LETTER B WITH 
DOT BELOW DOT BELOW 
1E07 1E06 LATIN SMALL LETTER B WITH — |LATIN CAPITAL LETTER B WITH 
LINE BELOW LINE BELOW 
1E09 1E08 LATIN SMALL LETTER C WITH _ | LATIN CAPITAL LETTER C WITH 
CEDILLA AND ACUTE CEDILLA AND ACUTE 
1E0B 1EOA LATIN SMALL LETTER D WITH | LATIN CAPITAL LETTER D WITH 
DOT ABOVE DOT ABOVE 
1EOD 1E0C LATIN SMALL LETTER D WITH | LATIN CAPITAL LETTER D WITH 
DOT BELOW DOT BELOW 
1EOF 1E0E LATIN SMALL LETTER D WITH | LATIN CAPITAL LETTER D WITH 
LINE BELOW LINE BELOW 
1E11 1E10 LATIN SMALL LETTER D WITH _ | LATIN CAPITAL LETTER D WITH 
CEDILLA CEDILLA 
1E13 1E12 LATIN SMALL LETTER D WITH _ | LATIN CAPITAL LETTER D WITH 
CIRCUMFLEX BELOW CIRCUMFLEX BELOW 
1E15 LATIN SMALL LETTER E WITH _ | LATIN CAPITAL LETTER E WITH 
MACRON AND GRAVE MACRON AND GRAVE 
1E17 LATIN SMALL LETTER E WITH _ | LATIN CAPITAL LETTER E WITH 
MACRON AND ACUTE MACRON AND ACUTE 
1E19 LATIN SMALL LETTER E WITH _ | LATIN CAPITAL LETTER E WITH 
CIRCUMFLEX BELOW CIRCUMFLEX BELOW 
1E1B LATIN SMALL LETTER E WITH _ | LATIN CAPITAL LETTER E WITH 
TILDE BELOW TILDE BELOW 
1E1D 1E1C LATIN SMALL LETTER E WITH _ | LATIN CAPITAL LETTER E WITH 
CEDILLA AND BREVE CEDILLA AND BREVE 
1E1F 1E1E LATIN SMALL LETTER F WITH | LATIN CAPITAL LETTER F WITH 
DOT ABOVE DOT ABOVE 
1E21 1E20 LATIN SMALL LETTER G WITH __| LATIN CAPITAL LETTER G WITH 
MACRON MACRON 
1E23 1E22 LATIN SMALL LETTER H WITH _ | LATIN CAPITAL LETTER H WITH 
DOT ABOVE DOT ABOVE 
1E25 1E24 LATIN SMALL LETTER H WITH _ | LATIN CAPITAL LETTER H WITH 
DOT BELOW DOT BELOW 
1E27 1E26 LATIN SMALL LETTER H WITH _ | LATIN CAPITAL LETTER H WITH 


DIAERESIS 


DIAERESIS 
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Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


1E29 1E28 LATIN SMALL LETTER H WITH LATIN CAPITAL LETTER H WITH 
CEDILLA CEDILLA 

1E2B 1E2A LATIN SMALL LETTER H WITH LATIN CAPITAL LETTER H WITH 
BREVE BELOW BREVE BELOW 

1E2D 1E2C LATIN SMALL LETTER | WITH LATIN CAPITAL LETTER | WITH 
TILDE BELOW TILDE BELOW 

1E2F 1E2E LATIN SMALL LETTER | WITH LATIN CAPITAL LETTER | WITH 
DIAERESIS AND ACUTE DIAERESIS AND ACUTE 

1E31 1E30 LATIN SMALL LETTER K WITH LATIN CAPITAL LETTER K WITH 
ACUTE ACUTE 

1E33 1E32 LATIN SMALL LETTER K WITH LATIN CAPITAL LETTER K WITH 
DOT BELOW DOT BELOW 

1E35 1E34 LATIN SMALL LETTER K WITH LATIN CAPITAL LETTER K WITH 
LINE BELOW LINE BELOW 

1E37 1E36 LATIN SMALL LETTER L WITH LATIN CAPITAL LETTER L WITH 
DOT BELOW DOT BELOW 

1E39 1E38 LATIN SMALL LETTER L WITH LATIN CAPITAL LETTER L WITH 
DOT BELOW AND MACRON DOT BELOW AND MACRON 

1E3B 1E3A LATIN SMALL LETTER L WITH LATIN CAPITAL LETTER L WITH 
LINE BELOW LINE BELOW 

1E3D 1E3C LATIN SMALL LETTER L WITH LATIN CAPITAL LETTER L WITH 
CIRCUMFLEX BELOW CIRCUMFLEX BELOW 

1E3F 1E3E LATIN SMALL LETTER M WITH LATIN CAPITAL LETTER M WITH 
ACUTE ACUTE 

1E41 1E40 LATIN SMALL LETTER M WITH LATIN CAPITAL LETTER M WITH 
DOT ABOVE DOT ABOVE 

1E43 1E42 LATIN SMALL LETTER M WITH LATIN CAPITAL LETTER M WITH 
DOT BELOW DOT BELOW 

1E45 1E44 LATIN SMALL LETTER N WITH LATIN CAPITAL LETTER N WITH 
DOT ABOVE DOT ABOVE 

1E47 1E46 LATIN SMALL LETTER N WITH LATIN CAPITAL LETTER N WITH 
DOT BELOW DOT BELOW 

1E49 1E48 LATIN SMALL LETTER N WITH LATIN CAPITAL LETTER N WITH 
LINE BELOW LINE BELOW 

1E4B 1E4A LATIN SMALL LETTER N WITH LATIN CAPITAL LETTER N WITH 
CIRCUMFLEX BELOW CIRCUMFLEX BELOW 

1E4D 1E4C LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
TILDE AND ACUTE TILDE AND ACUTE 

1E4F 1E4E LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
TILDE AND DIAERESIS TILDE AND DIAERESIS 

1E51 1E50 LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
MACRON AND GRAVE MACRON AND GRAVE 

1E53 1E52 LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
MACRON AND ACUTE MACRON AND ACUTE 

1E55 1E54 LATIN SMALL LETTER P WITH LATIN CAPITAL LETTER P WITH 


ACUTE 


ACUTE 


64 iSeries: Globalization (Handle data in global applications) 


Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


1E57 1E56 LATIN SMALL LETTER P WITH LATIN CAPITAL LETTER P WITH 
DOT ABOVE DOT ABOVE 

1E59 1E58 LATIN SMALL LETTER R WITH LATIN CAPITAL LETTER R WITH 
DOT ABOVE DOT ABOVE 

1E5B 1E5A LATIN SMALL LETTER R WITH LATIN CAPITAL LETTER R WITH 
DOT BELOW DOT BELOW 

1E5D 1E5C LATIN SMALL LETTER R WITH LATIN CAPITAL LETTER R WITH 
DOT BELOW AND MACRON DOT BELOW AND MACRON 

1E5F 1E5E LATIN SMALL LETTER R WITH LATIN CAPITAL LETTER R WITH 
LINE BELOW LINE BELOW 

1E61 1E60 LATIN SMALL LETTER S WITH LATIN CAPITAL LETTER S WITH 
DOT ABOVE DOT ABOVE 

1E63 1E62 LATIN SMALL LETTER S WITH LATIN CAPITAL LETTER S WITH 
DOT BELOW DOT BELOW 

1E65 1E64 LATIN SMALL LETTER S WITH LATIN CAPITAL LETTER S WITH 
ACUTE AND DOT ABOVE ACUTE AND DOT ABOVE 

1E67 1E66 LATIN SMALL LETTER S WITH LATIN CAPITAL LETTER S WITH 
CARON AND DOT ABOVE CARON AND DOT ABOVE 

1E69 1E68 LATIN SMALL LETTER S WITH LATIN CAPITAL LETTER S WITH 
DOT BELOW AND DOT ABOVE DOT BELOW AND DOT ABOVE 

1E6B 1E6A LATIN SMALL LETTER T WITH LATIN CAPITAL LETTER T WITH 
DOT ABOVE DOT ABOVE 

1E6D 1E6C LATIN SMALL LETTER T WITH LATIN CAPITAL LETTER T WITH 
DOT BELOW DOT BELOW 

1E6F 1E6E LATIN SMALL LETTER T WITH LATIN CAPITAL LETTER T WITH 
LINE BELOW LINE BELOW 

1E71 LATIN SMALL LETTER T WITH LATIN CAPITAL LETTER T WITH 
CIRCUMFLEX BELOW CIRCUMFLEX BELOW 

1E73 LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
DIAERESIS BELOW DIAERESIS BELOW 

1E75 LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
TILDE BELOW TILDE BELOW 

1E77 LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
CIRCUMFLEX BELOW CIRCUMFLEX BELOW 

1E79 1E78 LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
TILDE AND ACUTE TILDE AND ACUTE 

1E7B 1E7A LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
MACRON AND DIAERESIS MACRON AND DIAERESIS 

1E7D 1E7C LATIN SMALL LETTER V WITH LATIN CAPITAL LETTER V WITH 
TILDE TILDE 

1E7F 1E7E LATIN SMALL LETTER V WITH LATIN CAPITAL LETTER V WITH 
DOT BELOW DOT BELOW 

1E81 1E80 LATIN SMALL LETTER W WITH LATIN CAPITAL LETTER W WITH 
GRAVE GRAVE 

1E83 1E82 LATIN SMALL LETTER W WITH LATIN CAPITAL LETTER W WITH 


ACUTE 


ACUTE 
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65 


Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


1E85 1E84 LATIN SMALL LETTER W WITH LATIN CAPITAL LETTER W WITH 
DIAERESIS DIAERESIS 

1E87 1E86 LATIN SMALL LETTER W WITH LATIN CAPITAL LETTER W WITH 
DOT ABOVE DOT ABOVE 

1E89 1E88 LATIN SMALL LETTER W WITH LATIN CAPITAL LETTER W WITH 
DOT BELOW DOT BELOW 

1E8B 1E8A LATIN SMALL LETTER X WITH LATIN CAPITAL LETTER X WITH 
DOT ABOVE DOT ABOVE 

1E8D 1E8C LATIN SMALL LETTER X WITH LATIN CAPITAL LETTER X5 WITH 
DIAERESIS DIAERESIS 

1E8F 1E8E LATIN SMALL LETTER Y WITH LATIN CAPITAL LETTER Y WITH 
DOT ABOVE DOT ABOVE 

1E91 1E90 LATIN SMALL LETTER Z WITH LATIN CAPITAL LETTER Z WITH 
CIRCUMFLEX CIRCUMFLEX 

1E93 1E92 LATIN SMALL LETTER Z WITH LATIN CAPITAL LETTER Z WITH 
DOT BELOW DOT BELOW 

1E95 1E94 LATIN SMALL LETTER Z WITH LATIN CAPITAL LETTER Z WITH 
LINE BELOW LINE BELOW 

1EA1 1EA0O LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
DOT BELOW DOT BELOW 

1EA3 1EA2 LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
HOOK ABOVE HOOK ABOVE 

1EA5 1EA4 LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
CIRCUMFLEX AND ACUTE CIRCUMFLEX AND ACUTE 

1EA7 1EA6 LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
CIRCUMFLEX AND GRAVE CIRCUMFLEX AND GRAVE 

1EA9 1EA8 LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
CIRCUMFLEX AND HOOK ABOVE | CIRCUMFLEX AND HOOK ABOVE 

1EAB 1EAA LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
CIRCUMFLEX AND TILDE CIRCUMFLEX AND TILDE 

1EAD 1EAC LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
CIRCUMFLEX AND DOT BELOW | CIRCUMFLEX AND DOT BELOW 

1EAF 1EAE LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
BREVE AND ACUTE BREVE AND ACUTE 

1EB1 1EBO LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
BREVE AND GRAVE BREVE AND GRAVE 

1EB3 1EB2 LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
BREVE AND HOOK ABOVE BREVE AND HOOK ABOVE 

1EB5 1EB4 LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
BREVE AND TILDE BREVE AND TILDE 

1EB7 1EB6 LATIN SMALL LETTER A WITH LATIN CAPITAL LETTER A WITH 
BREVE AND DOT BELOW BREVE AND DOT BELOW 

1EB9 1EB8 LATIN SMALL LETTER E WITH LATIN CAPITAL LETTER E WITH 
DOT BELOW DOT BELOW 

1EBB 1EBA LATIN SMALL LETTER E WITH LATIN CAPITAL LETTER E WITH 


HOOK ABOVE 


HOOK ABOVE 


66 iSeries: Globalization (Handle data in global applications) 


Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


1EBD 1EBC LATIN SMALL LETTER E WITH LATIN CAPITAL LETTER E WITH 
TILDE TILDE 

1EBF 1EBE LATIN SMALL LETTER E WITH LATIN CAPITAL LETTER E WITH 
CIRCUMFLEX AND ACUTE CIRCUMFLEX AND ACUTE 

1EC1 1ECO LATIN SMALL LETTER E WITH LATIN CAPITAL LETTER E WITH 
CIRCUMFLEX AND GRAVE CIRCUMFLEX AND GRAVE 

1EC3 1EC2 LATIN SMALL LETTER E WITH LATIN CAPITAL LETTER E WITH 
CIRCUMFLEX AND HOOK ABOVE | CIRCUMFLEX AND HOOK ABOVE 

1EC5 1EC4 LATIN SMALL LETTER E WITH LATIN CAPITAL LETTER E WITH 
CIRCUMFLEX AND TILDE CIRCUMFLEX AND TILDE 

1EC7 1EC6 LATIN SMALL LETTER E WITH LATIN CAPITAL LETTER E WITH 
CIRCUMFLEX AND DOT BELOW | CIRCUMFLEX AND DOT BELOW 

1EC9 1EC8 LATIN SMALL LETTER | WITH LATIN CAPITAL LETTER | WITH 
HOOK ABOVE HOOK ABOVE 

1ECB 1ECA LATIN SMALL LETTER | WITH LATIN CAPITAL LETTER | WITH 
DOT BELOW DOT BELOW 

1ECD 1ECC LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
DOT BELOW DOT BELOW 

1ECF 1ECE LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
HOOK ABOVE HOOK ABOVE 

1ED1 1EDO LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
CIRCUMFLEX AND ACUTE CRCUMFLEX AND ACUTE 

1ED3 1ED2 LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
CIRCUMFLEX AND GRAVE CIRCUMFLEX AND GRAVE 

1ED5 1ED4 LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
CIRCUMFLEX AND HOOK ABOVE | CIRCUMFLEX AND HOOK ABOVE 

1ED7 LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
CIRCUMFLEX AND TILDE CIRCUMFLEX AND TILDE 

1ED9 LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
CIRCUMFLEX AND DOT BELOW | CIRCUMFLEX AND DOT BELOW 

1EDB LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
HORN AND ACUTE HORN AND ACUTE 

1EDD LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
HORN AND GRAVE HORN AND GRAVE 

1EDF 1EDE LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
HORN AND HOOK ABOVE HORN AND HOOK ABOVE 

1EE1 1EE0O LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
HORN AND TILDE HORN AND TILDE 

1EE3 1EE2 LATIN SMALL LETTER O WITH LATIN CAPITAL LETTER O WITH 
HORN AND DOT BELOW HORN AND DOT BELOW 

1EE5 1EE4 LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
DOT BELOW DOT BELOW 

1EE7 1EE6 LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
HOOK ABOVE HOOK ABOVE 

1EE9 1EE8 LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 


HORN AND ACUTE 


HORN AND ACUTE 


OS/400 globalization 


67 


Lowercase code Uppercase code 


point point Lowercase character description | Uppercase character description 

1EEB 1EEA LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
HORN AND GRAVE HORN AND GRAVE 

1EED 1EEC LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
HORN AND HOCK ABOVE HORN AND HOOK ABOVE 

1EEF 1EEE LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
HORN AND TILDE HORN AND TILDE 

1EF1 1EFO LATIN SMALL LETTER U WITH LATIN CAPITAL LETTER U WITH 
HORN AND DOT BELOW HORN AND DOT BELOW 

1EF3 1EF2 LATIN SMALL LETTER Y WITH LATIN CAPITAL LETTER Y WITH 
GRAVE GRAVE 

1EF5 1EF4 LATIN SMALL LETTER Y WITH LATIN CAPITAL LETTER Y WITH 
DOT BELOW DOT BELOW 

1EF7 1EF6 LATIN SMALL LETTER Y WITH LATIN CAPITAL LETTER Y WITH 
HOOK ABOVE HOOK ABOVE 

1EF9 1EF8 LATIN SMALL LETTER Y WITH LATIN CAPITAL LETTER Y WITH 
TILDE TILDE 

1F00 1F08 GREEK SMALL LETTER ALPHA — | GREEK CAPITAL LETTER ALPHA 
WITH PSILI WITH PSILI 

1F01 1F09 GREEK SMALL LETTER ALPHA | GREEK CAPITAL LETTER ALPHA 
WITH DASIA WITH DASIA 

1F02 1FOA GREEK SMALL LETTER ALPHA | GREEK CAPITAL LETTER ALPHA 
WITH PSILI AND VARIA WITH PSILI AND VARIA 

1F03 1FOB GREEK SMALL LETTER ALPHA — | GREEK CAPITAL LETTER ALPHA 
WITH DASIA AND VARIA WITH DASIA AND VARIA 

1F04 1FOC GREEK SMALL LETTER ALPHA — | GREEK CAPITAL LETTER ALPHA 
WITH PSILI AND OXIA WITH PSILI AND OXIA 

1F05 1FOD GREEK SMALL LETTER ALPHA — | GREEK CAPITAL LETTER ALPHA 
WITH DASIA AND OXIA WITH DASIA AND OXIA 

1F06 1FOE GREEK SMALL LETTER ALPHA — | GREEK CAPITAL LETTER ALPHA 
WITH PSILI AND PERISPOMENI | WITH PSILI AND PERISPOMENI 

1FO7 1FOF GREEK SMALL LETTER ALPHA — | GREEK CAPITAL LETTER ALPHA 
WITH DASIA AND PERISPOMENI | WITH DASIA AND PERISPOMENI 

1F10 1F18 GREEK SMALL LETTER GREEK CAPITAL LETTER EPSILON 
EPSILON WITH PSILI WITH PSILI 

1F11 1F19 GREEK SMALL LETTER GREEK CAPITAL LETTER EPSILON 
EPSILON WITH DASIA WITH DASIA 

1F12 1FIA GREEK SMALL LETTER GREEK CAPITAL LETTER EPSILON 
EPSILON WITH PSILI AND VARIA | WITH PSILI AND VARIA 

1F13 1F1B GREEK SMALL LETTER GREEK CAPITAL LETTER EPSILON 
EPSILON WITH DASIA AND WITH DASIA AND VARIA 
VARIA 

1F14 1F1C GREEK SMALL LETTER GREEK CAPITAL LETTER EPSILON 
EPSILON WITH PSILI AND OXIA | WITH PSILI AND OXIA 

1F15 1F1D GREEK SMALL LETTER GREEK CAPITAL LETTER EPSILON 
EPSILON WITH DASIA AND OXIA | WITH DASIA AND OXIA 

1F20 1F28 GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH PSILI WITH PSILI 
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Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


1F21 1F29 GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH DASIA WITH DASIA 

1F22 1F2A GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH PSILI AND VARIA WITH PSILI AND VARIA 

1F23 1F2B GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH DASIA AND VARIA WITH DASIA AND VARIA 

1F24 1F2C GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH PSILI AND OXIA WITH PSILI AND OXIA 

1F25 1F2D GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH DASIA AND OXIA WITH DASIA AND OXIA 

1F26 1F2E GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH PSILI AND PERISPOMENI | WITH PSILI AND PERISPOMENI 

1F27 1F2F GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH DASIA AND PERISPOMENI | WITH DASIA AND PERISPOMENI 

1F30 1F38 GREEK SMALL LETTER IOTA GREEK CAPITAL LETTER IOTA 
WITH PSILI WITH PSILI 

1F31 1F39 GREEK SMALL LETTER IOTA GREEK CAPITAL LETTER IOTA 
WITH DASIA WITH DASIA 

1F32 1F3A GREEK SMALL LETTER IOTA GREEK CAPITAL LETTER IOTA 
WITH PSILI AND VARIA WITH PSILI AND VARIA 

1F33 1F3B GREEK SMALL LETTER IOTA GREEK CAPITAL LETTER IOTA 
WITH DASIA AND VARIA WITH DASIA AND VARIA 

1F34 1F3C GREEK SMALL LETTER IOTA GREEK CAPITAL LETTER IOTA 
WITH PSILI AND OXIA WITH PSILI AND OXIA 

1F35 1F3D GREEK SMALL LETTER IOTA GREEK CAPITAL LETTER IOTA 
WITH DASIA AND OXIA WITH DASIA AND OXIA 

1F36 GREEK SMALL LETTER IOTA GREEK CAPITAL LETTER IOTA 
WITH PSILI AND PERISPOMENI | WITH PSILI AND PERISPOMENI 

1F37 GREEK SMALL LETTER IOTA GREEK CAPITAL LETTER IOTA 
WITH DASIA AND PERISPOMENI | WITH DASIA AND PERISPOMENI 

1F40 GREEK SMALL LETTER GREEK CAPITAL LETTER 
OMICRON WITH PSILI OMICRON WITH PSILI 

1F41 GREEK SMALL LETTER GREEK CAPITAL LETTER 
OMICRON WITH DASIA OMICRON WITH DASIA 

1F42 1F4A GREEK SMALL LETTER GREEK CAPITAL LETTER 
OMICRON WITH PSILI AND OMICRON WITH PSILI AND VARIA 
VARIA 

1F43 1F4B GREEK SMALL LETTER GREEK CAPITAL LETTER 
OMICRON WITH DASIA AND OMICRON WITH DASIA AND VARIA 
VARIA 

1F44 1F4C GREEK SMALL LETTER GREEK CAPITAL LETTER 
OMICRON WITH PSILI AND OXIA | OMICRON WITH PSILI AND OXIA 

1F45 1F4D GREEK SMALL LETTER GREEK CAPITAL LETTER 
OMICRON WITH DASIA AND OMICRON WITH DASIA AND OXIA 
OXIA 

1F51 1F59 GREEK SMALL LETTER GREEK CAPITAL LETTER UPSILON 


UPSILON WITH DASIA 


WITH OASIS 
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Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


1F53 1F5B GREEK SMALL LETTER GREEK CAPITAL LETTER UPSILON 
UPSILON WITH DASIA AND WITH DASIA AND VARIA 
VARIA 

1F55 1F5D GREEK SMALL LETTER GREEK CAPITAL LETTER UPSILON 
UPSILON WITH DASIA AND OXIA | WITH DASIA AND OXIA 

1F57 1F5F GREEK SMALL LETTER GREEK CAPITAL LETTER UPSILON 
UPSILON WITH DASIA AND WITH DASIA AND PERISPOMENI 
PERISPOMENI 

1F60 1F68 GREEK SMALL LETTER OMEGA _| GREEK CAPITAL LETTER OMEGA 
WITh PSILI WITH PSILI 

1F61 1F69 GREEK SMALL LETTER OMEGA _| GREEK CAPITAL LETTER OMEGA 
WITH DASIA WITH DASIA 

1F62 1F6A GREEK SMALL LETTER OMEGA | GREEK CAPITAL LETTER OMEGA 
WITH PSILI AND VARIA WITH PSILI AND VARIA 

1F63 1F6B GREEK SMALL LETTER OMEGA _| GREEK CAPITAL LETTER OMEGA 
WITH DASIA AND VARIA WITH DASIA AND VARIA 

1F64 1F6C GREEK SMALL LETTER OMEGA _| GREEK CAPITAL LETTER OMEGA 
WITH PSILI AND OXIA WITH PSILI AND OXIA 

1F65 1F6D GREEK SMALL LETTER OMEGA _| GREEK CAPITAL LETTER OMEGA 
WITH DASIA AND OXIA WITH DASIA AND OXIA 

1F66 1F6E GREEK SMALL LETTER OMEGA | GREEK CAPITAL LETTER OMEGA 
WITH PSILI AND PERISPOMENI | WITH PSILI AND PERISPOMENI 

1F67 1F6F GREEK SMALL LETTER OMEGA _| GREEK CAPITAL LETTER OMEGA 
WITH DASIA AND PERISPOMENI | WITH DASIA AND PERISPOMENI 

1F80 1F88 GREEK SMALL LETTER ALPHA | GREEK CAPITAL LETTER ALPHA 
WITH PSILI AND WITh PSILI AND 
YPOGEGRAMMENI PROSGEGRAMMENI 

1F81 1F89 GREEK SMALL LETTER ALPHA | GREEK CAPITAL LETTER ALPHA 
WITH DASIA AND WITH DASIA AND 
YPOGEGRAMMENI PROSGEGRAMMENI 

1F82 1F8A GREEK SMALL LETTER ALPHA | GREEK CAPITAL LETTER ALPHA 
WITH PSILI AND VARIA AND WITH PSILI AND VARIA AND 
YPOGEGRAMMENI PROSGEGRAMMENI 

1F83 1F8B GREEK SMALL LETTER ALPHA | GREEK CAPITAL LETTER ALPHA 
WITH DASIA AND VARIA AND WITH DASIA AND VARIA AND 
YPOGEGRAMMENI PROSGEGRAMMENI 

1F84 1F8C GREEK SMALL LETTER ALPHA | GREEK CAPITAL LETTER ALPHA 
WITH PSILI AND OXIA AND WITH PSILI AND OXIA AND 
YPOGEGRAMMENI PROSGEGRAMMEN 

1F85 1F8D GREEK SMALL LETTER ALPHA | GREEK CAPITAL LETTER ALPHA 
WITH DASIA AND OXIA AND WITH DASIA AND OXIA AND 
YPOGEGRAMMENI PROSGEGRAMMEN 

1F86 1F8E GREEK SMALL LETTER ALPHA | GREEK CAPITAL LETTER ALPHA 
WITH PSILI AND PERISPOMENI | WITH PSILI AND PERISPOMENI 
AND YPOGEGRAMMENI AND PROSGEGRAMMENI 

1F87 1F8F GREEK SMALL LETTER ALPHA | GREEK CAPITAL LETTER ALPHA 


WITH DASIA AND PERISPOMENI 
AND YPOGEGRAMMENI 


WITH DASIA AND PERISPOMENI 
AND PROSGEGRAMMENI 
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Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


1F90 1F98 GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH PSILI AND WITH PSILI AND 
YPOGEGRAMMENI PROSGEGRAMMENI 

1F91 1F99 GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH DASIA AND WITH DASIA AND 
YPOGEGRAMMENI PROSGEGRAMMENI 

1F92 1F9A GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH PSILI AND VARIA AND WITH PSILI AND VARIA AND 
YPOGEGRAMMENI PROSGEGRAMMENI 

1F93 1F9B GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH DASIA AND VARIA AND WITH DASIA AND VARIA AND 
YPOGEGRAMMENI PROSGEGRAMMENI 

1F94 1F9C GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH PSILI AND OXIA AND WITH PSILI AND OXIA AND 
YPOGEGRAMMENI PROSGEGRAMMENI 

1F95 1F9D GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH DASIA AND OXIA AND WITH DASIA AND OXIA AND 
YPOGEGRAMMENI PROSGEGRAMMENI 

1F96 1F9E GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH PSILI AND PERISPOMENI | WITH PSILI AND PERISPOMENI 
AND YPOGEGRAMMENI AND PROSGEGRAMMENI 

1F97 1FOF GREEK SMALL LETTER ETA GREEK CAPITAL LETTER ETA 
WITH DASIA AND PERISPOMENI | WITH DASIA AND PERISPOMENI 
AND YPOGEGRAMMENI AND PROSGEGRAMMENI 

1FAO 1FA8 GREEK SMALL LETTER OMEGA _| GREEK CAPITAL LETTER OMEGA 
WITH PSILI AND WITH PSILI AND 
YPOGEGRAMMENI PROSGEGRAMMENI 

1FA1 1FA9 GREEK SMALL LETTER OMEGA _| GREEK CAPITAL LETTER OMEGA 
WITH DASIA AND WITH DASIA AND 
YPOGEGRAMMENI PROSGEGRAMMENI 

1FA2 1FAA GREEK SMALL LETTER OMEGA _| GREEK CAPITAL LETTER OMEGA 
WITH PSILI AND VARIA AND WITH PSILI AND VARIA AND 
YPOGEGRAMMENI PROSGEGRAMMENI 

1FA3 1FAB GREEK SMALL LETTER OMEGA _| GREEK CAPITAL LETTER OMEGA 
WITH DASIA AND VARIA AND WITH DASIA AND VARIA AND 
YPOGEGRAMMENI PROSGEGRAMMENI 

1FA4 1FAC GREEK SMALL LETTER OMEGA _| GREEK CAPITAL LETTER OMEGA 
WITH PSILI AND OXIA AND WITH PSILI AND OXIA AND 
YPOGEGRAMMENI PROSGEGRAMMENI 

1FA5 1FAD GREEK SMALL LETTER OMEGA _| GREEK CAPITAL LETTER OMEGA 
WITH DASIA AND OXIA AND WITH DASIA AND OXIA AND 
YPOGEGRAMMENI PROSGEGRAMMENI 

1FA6 1FAE GREEK SMALL LETTER OMEGA _| GREEK CAPITAL LETTER OMEGA 
WITh PSILI AND PERISPOMENI | WITH PSILI AND PERISPOMENI 
AND YPOGEGRAMMENI AND PROSGEGRAMMENI 

1FA7 1FAF GREEK SMALL LETTER OMEGA _| GREEK CAPITAL LETTER OMECA 
WITH DASIA AND PEPISPOMENI | WITH DASIA AND PERISPOMENI 
AND YPOGEGRAMMENI AND PROSGEGRAMMENI 

1FBO 1FB8 GREEK SMALL LETTER ALPHA — | GREEK CAPITAL LETTER ALPHA 


WITH VRACHY 


WITH VRACHY 
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Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


1FB1 1FB9 GREEK SMALL LETTER ALPHA | GREEK CAPITAL LETTER ALPHA 
WITH MACRON WITH MACRON 

1FDO 1FD8 GREEK SMALL LETTER IOTA —_| GREEK CAPITAL LETTER IOTA 
WITH VRACHY WITH VRACHY 

1FD1 1FD9 GREEK SMALL LETTER IOTA —_| GREEK CAPITAL LETTER IOTA 
WITH MACRON WITH MACRON 

1FEO 1FE8 GREEK SMALL LETTER GREEK CAPITAL LETTER UPSILON 
UPSILON WITH VRACHY WITH VRACHY 

1FE1 1FE9 GREEK SMALL LETTER GREEK CAPITAL LETTER UPSILON 
UPSILON WITH MACRON WITH MACRON 

24D0 24B6 CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
A A 

24D1 24B7 CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
B B 

24D2 24B8 CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
Cc Cc 

24D3 24B9 CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
D D 

24D4 24BA CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
E E 

24D5 24BB CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
F F 

24D6 24BC CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
G G 

24D7 24BD CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
H H 

24D8 24BE CIRCLED LATIN SMALL LETTER || CIRCLED LATIN CAPITAL LETTER | 

24D9 24BF CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER J 
J 

24DA 24C0 CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
K K 

24DB 24C1 CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
L L 

24DC 2402 CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
M M 

24DD 2403 CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
N N 

24DE 2404 CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
fo) O 

24DF 24C5 CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
P P 

24E0 24C6 CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
Q Q 

24E1 24C7 CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
R R 

24E2 24C8 CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 


‘S) 
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S) 


Lowercase code 
point 


Uppercase code 
point 


Lowercase character description 


Uppercase character description 


24E3 24C9 CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
T T 

24E4 24CA CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
U U 

24E5 24CB CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
V V 

24E6 24CC CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
W WwW 

24E7 24CD CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
X X 

24E8 24CE CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
Y. Y 

24E9 24CF CIRCLED LATIN SMALL LETTER | CIRCLED LATIN CAPITAL LETTER 
Z Z 

FF41 FF21 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTERA LETTERA 

FF42 FF22 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER B LETTER B 

FF43 FF23 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER C LETTER C 

FF44 FF24 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER D LETTER D 

FF45 FF25 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER E LETTER E 

FF46 FF26 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER F LETTER F 

FF47 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER G LETTER G 

FF48 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER H LETTER H 

FF49 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER | LETTER | 

FF4A FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER J LETTER J 

FF4B FF2B FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER K LETTER K 

FF4C FF2C FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER L LETTER L 

FF4D FF2D FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER M LETTER M 

FF4E FF2E FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER N LETTER N 

FF4F FF2F FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER O LETTER O 

FF50 FF30 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 


LETTER P 


LETTER P 
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Lowercase code Uppercase code 

point point Lowercase character description | Uppercase character description 

FF51 FF31 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER Q LETTER Q 

FF52 FF32 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER R LETTER R 

FF53 FF33 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER S LETTER S 

FF54 FF34 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER T LETTER T 

FF55 FF35 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER U LETTER U 

FF56 FF36 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER V LETTER V 

FF57 FF37 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER W LETTER W 

FF58 FF38 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER X LETTER X 

FF59 FF39 FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER Y LETTER Y 

FF5A FF3A FULLWIDTH LATIN SMALL FULLWIDTH LATIN CAPITAL 
LETTER Z LETTER Z 


GB18030: The Chinese standard 
Pa 


GB 18030-2000 is a Chinese standard that specifies an extended code page for use in the Chinese 
market. This code page standard is important for the software industry because the China National 
Information Technology Standardization Technical Committee has mandated that any software application 
that is released for the Chinese market after September 1, 2001, be enabled for GB18030. 


OS/400 supports this encoding with CCSID 1392. Generally, you should use Unicode instead of 1392 for 
complete national language support. CCSID 1392 is provided if you need to handle or interchange 
GB18030 encoded data. See lUnicodd for more information. 


A brief history of major GB code pages 


A common base code page standard for Chinese is GB 2312-1980. It encodes more than 6,000 
frequently-used Chinese ideographs. With the growing importance of Unicode and the parallel standard 
ISO 10646 (which was adopted by China as GB 13000), an extension of GB 2312-1980 was created. This 
extension was called GBK; it encoded all 20,902 unified ideographs that are assigned in Unicode 2.1. GBK 
is not a formal standard, but a widely-implemented specification. 


Unicode 3.0 added more than 6,000 ideographs, and version 3.1 added about 42,000 additional 
ideographs. 


GB 18030 was created as an update of GBK for Unicode 3.0 with an extension that covers all of Unicode. 
It has the following general features: 


¢ GB 18030 character assignments are backwards compatible with the GB 2312-1980 standard and the 
GBK specification. 
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¢ The mapping table between GB 18030 and Unicode is backwards compatible with the one between GB 
2312-1980 and Unicode, and with some exceptions (with the one between GBK and Unicode), most of 
the changes compared to the GBK mapping table are due to updates for Unicode 3.0. 

¢ GB 18030 specifies a mapping table that covers all Unicode code points. It is functionally similar to a 
UTF (Unicode Transformation Format) while maintaining compatibility of GB-encoded text with GBK and 
GB 2312-1980. 


« 
Work with CCSIDs 


This topic describes how the server implements the Character Data Representation Architecture (CDRA). 
Using the server implementation of CDRA, you can achieve consistent representation, processing, and 
interchange of coded characters (data) on OS/400 and across IBM systems. The primary implementation 
of CDRA on OS/400 is through coded character set identifier (CCSID) support. 


¢ Recommendations and guidelines for using CCSIDs 
¢ Change the CCSID of a physical file 

¢ Graphic character (data) sort implementation 

¢ CCSID support for messages 


Recommendations and guidelines for using CCSIDs 

When writing global applications, the following are some recommendations to remember: 

¢ Because the system is shipped with a default CCSID of 65535] character data conversions do not 
normally occur in applications. You should look over the CCSID information in this topic, however, 
because the system may need to participate in a multilingual environment, a network, or exchanging 
data at a later time. 

¢ Applications implementing their own mapping scheme should use CCSID 65535, where a CCSID 
assignment is necessary. For example, depending on what an application does, it might need to use 
CCSID 65535 for the files, or it might need to use CCSID 65535 for the jobs. Because other 
applications may require CCSIDs other than 65535, consider changing such applications by replacing 
the mapping scheme with CCSID support. 

* Correctly define fields based on their usage. If fields contain application-dependent values (for example, 
control characters or fields that are not used as real character fields), define the fields as hexadecimal 
data or character fields with CCSID 65535. 


¢ Avoid using characters that are not in the invariant character set for names and literals in programs. 


Follow these guidelines when using CCSIDs: 


¢ Use CCSIDs in multilingual applications to maintain character integrity in database files, displays, and 
printed data. 


* You can find a suggested CCSID for a language in 


¢ If the QIGC system value is set on, set QCCSID as a mixed CCSID or 65535. For more information on 
QIGC, see IDBGS system indicator (0180) system value. 


* If you use DBCS support, set the job CCSID to a mixed CCSID. If you do not, set the job CCSID to a 
single-byte CCSID. 


¢ Ensure that the QCHRID code page is compatible with the character set and code page of the QCCSID 
value, unless the QCCSID value is 65535. If the QCCSID value is changed to a value that is 
incompatible with the current QCHRID value, the QCHRID value is changed to a compatible value by 
the system. 

¢ If you use a user-defined data stream (UDDS), remove any X’3F’ values inserted by CCSID 
conversions. Otherwise, your data can cause the system to blank out a screen. Some CCSID 
conversions use a X’3F’ value for a substitution character. 
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* lf you are using any interactive jobs, such as Application Development ToolSet/400, ensure that the 
code page of the job CCSID matches the code page of the keyboard type. If these CCSID values do 
not match, or the job CCSID is 65535, unpredictable results could occur. For more information, see 


°* Be aware that the “JOBCCSID eliport is not used by any veto supplied displays or panel groups, 
although CHRIDCTL support is used. 

¢ Be aware of character data that has been defined or specified as control information. For new database 
files, fields that contain control information should be defined as hexadecimal datatype or use CCSID 
65535 instead of another CCSID. 

* Because of workstation hardware restrictions, you may not see all of the characters on displays other 
than 3486, 3487, 3488, or Personal System/2 (PS/2")) displays when CCSID conversion occurs. 
However, the character data is retained in the system. 

¢ Be aware that when a CCSID conversion is performed, substitution characters may_cause a loss of 
data. The situation occurs if enforced subset match conversion is performed (see See ofl 


OS/400 function support for CCSIDs 
The server provides support for CCSIDs in the functions as shown in the following table: 


Function Description of support 


CL commands Some control language (CL) commands have internal 
functions that support CCSID conversions. For more 

information about CL commands that support CCSID 

conversions, see the CL Referencd topic. 


Copy Coded character set identifier (CCSID) support is built 
into the copy function. The Copy File and Copy 
from Query File [cPYEBMOBYE commands support 
CCSIDs. To use the CPYF command to change a 
physical file, see [Cha 


The Copy Source File (CPYSRCF) command Sunpons 
CCSID conversion. 


Database management Database management support provides default coded 
character set identifier (CCSID) values for database files 
on the server. See the Database management topic for 
details. 
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Function 


Description of support 


DDM 


Coded character set identifier (CCSID) support is built 
into distributed data management (DDM). DDM provides 
support to pass CCSID tags in homogeneous 
environments. DDM passes a CCSID parameter when 
sending files. With DDM, you can also specify a CCSID 
when creating files on a remote system. DDM only 
converts data to the job CCSID of the source system 
when: 


¢ The source and target systems are iSeries servers. 


* The source and target systems are at an operating 
system level of Version 2 Release 1.1 or later. 


Program-described files are always created with a CCSID 
of 65535 if they are created: 


* Ona target AS/400‘*) system on a release level from 
Version 2 Release 1.1 through Version 2 Release 3 


* From a source system that is not an iSeries server 


¢ From a source system that is an AS/400 system at a 
release level before Version 2 Release 1.1 


You can use the Submit Remote Command 

command on a source iSeries server to 
change the file CCSID (externally described files only) by 
specifying the command and the CCSID 
parameter. 


DDS 


Coded character set identifier (CCSID) support is built 
into data description specifications (DDS). DDS supports 
file-level and field-level CCSID keywords for all character 
fields in physical files. DDS also supports file-level and 
field-level keywords for all DBCS fields in physical files. 


Distributed relational database 


Coded character set identifier (CCSID) support is built 
into distributed relational database. Distributed relational 
database passes the CCSID of an application requester 
(AR) job to an application server (AS) job and vice versa 
during connect processing. Distributed relational database 
also performs a conversion of error information and 
text-describing fields according to the job CCSID. 


Distributed relational database uses CCSID information to 
determine how to build data exchanged between 
application requester jobs and application server jobs. It 
also uses CCSID information to describe data exchanged 
between application requester jobs and application server 
jobs (for example, a format description). 


IDDU 


Coded character set identifier (CCSID) support is built 
into interactive data definition utility (IDDU). Interactive 
data definition utility provides support to specify a CCSID 
for a character field or a DBCS field. 
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Function Description of support 


Open Query File (OPNQRYF) Coded character set identifier (CCSID) support is built 
into OS/400 query. You can use the Open Query File 

command to specify a CCSID on the 
MAPFLD parameter. The MAPFLD parameter specifies 
the definition of query fields that are either mapped to, or 
derived from, other fields. 


OS/400 query supports CCSID conversion on CHAR, 
OPEN, EITHER, and UCS-2 graphic field operators for 
join, record selection, group-by, and minimum or 
maximum values functions. CCSID conversion is 
performed whenever fields do not have the same CCSID 
value. Once the query is opened, database management 
support converts data read or written to the database files 
as described in the Database management’) topic. 


OS/400 query does not support CCSID conversion if at 
least one of the fields is assigned a CCSID of 65535. 


Query management Coded character set identifier (CCSID) support is built 
into query management. Query management assigns a 
CCSID to queries and forms. Query management: 


* Converts queries to the job CCSID. 


* Presents data to the display device using the job 
CCSID. 


¢ Assigns a CCSID to the files it creates. 


SNA Coded character set identifier (CCSID) support is built 
into SNA Distributed Services (SNADS). SNADS supports 
CCSIDs by any user ID, system name, or destination 
queue name. However, other SNADS services such as 
SNDNETF do not provide CCSID conversion. 


Work management Work management support provides the function to 
assign or change coded character set identifier (CCSID) 
values at three different levels. See the Work 
management topic for details. 


Workstations The workstation management function provides support 
for display files, printer files, and panel groups. See the 
Workstation function management topic for details. 


Database management: Database management support provides default coded character set identifier 
(CCSID) values for database files on the server. All database files are assigned a CCSID. At file creation 
time, the CCSID is either explicitly assigned through DDS, SQL, or IDDU, or it is implicitly assigned the job 
default CCSID (DFTCCSID). 


Database files support for CCSIDs: 

IBM system files and licensed program database files are created with the CCSID of choice for each of the 
national language versions. Only the customer files are automatically assigned the CCSID of the job 
creating the file. You can use the Display File Description (DSPFD) command to view the CCSID of a file. 
Program-described files are assigned CCSID 65535. If a CCSID is not explicitly specified on the CRTPF or 


CRTSRCPF command, database source files default to the job default CCSID at file creation. For more 
information on job default CCSIDs, see Job default coded character set identifier . For a list of 


language identifiers and the DFTCCSID valued associated with those identifiers, see 
oe en Default ccsipd table. 
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If a database logical file is defined over several physical files, it is assigned a CCSID at the field level and 
assumes the CCSID value of the physical file. Logical files cannot be explicitly assigned a CCSID value. 


Database fields and support for CCSIDs: 


Except for numeric database fields, database fields are supported by CCSIDs. You can use the Display 
File Field Description (DSPEED) command to view the CCSID of the fields in a file. 


Hexadecimal fields are assigned CCSID 65535. 


An implicit CCSID value is assigned to the following fields if a CCSID was not explicitly assigned through 
DDS, SQL, or IDDU at file creation: 


¢ Physical-file character 
¢ DBCS-open 

* DBCS-only 

* DBCS-either 

* Graphic 


The implicitly assigned CCSID is the job default CCSID, or a CCSID associated with the job default 
CCSID. 


¢ Acharacter field will be assigned the single-byte character set (SBCS) CCSID that is associated with 
the job default CCSID. 


* ADBCS-open, DBCS-only, and DBCS-either field will be assigned the mixed byte CCSID. 


¢ A Graphic field will be assigned the double-byte character set (DBCS) CCSID that is associated with the 
job default CCSID. 


For example, if the job default CCSID is 5026 (which is a CCSID that identifies mixed data), an SBCS 
character field will be assigned the SBCS CCSID associated with 5026. Thus, the CCSID for that field 
would be 290. If there is no CCSID of the required character set type then a CCSID of 65535 will be used. 
See Job Default Coded Character. Set Identifier (DETCCSID) for more information on job default CCSIDs. 
DS for a list of language identifiers and the 


DFTCCSID valued associated with 086 idenifflets: 


Database logical-file fields are assigned a CCSID value based on their data type and the data type of the 
underlying physical file field. 


Database management and conversion support for CCSIDs: 


Database management support converts non-graphic character data read from, or written to, database 
files using the file CCSID and the job CCSID. 


* If data is being read from a database file and the CCSID of the file is the same as the job CCSID, no 
conversion is done. 

* If data is being read from a database file and the CCSID of the file and the job CCSID are different, the 
data is converted to the CCSID of the job. 

* lf data is being written to a database file and the CCSID of the file is the same as the job CCSID, no 
conversion is done. 


* lf data is being written to a database file and the CCSID of the file and the job CCSID are different, the 
data is converted to match the CCSID of the file. 


No conversion is performed if either the CCSID of the job or the CCSID of the database file is equal to 
65535. 
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Work management: Work management support provides the function to assign or change coded 
character set identifier (CCSID) values at three different levels. All jobs run with a CCSID value 
established at one of these levels: 


* Job level. A CCSID is assigned to a job. 


¢ User profile level. ACCSID is specified in a user profile and the value is assigned _to all jobs run under 
that user profile. The CCSID can be set or changed with the Create User Profile (CRTUSEPRE] and 
Change User Profile commands. 


* System level. The system value QCCSID is the default CCSID for all jobs running on the system. 
QCCSID can be set or changed with the ICHGSYSVAII and bwexsysvall commands. 


Work management support initializes the job CCSID for an interactive job to the CCSID on the user profile 
when the job starts. If *“SYSVAL is specified for the CCSID on the user profile, work management support 
gets the CCSID from the system value (QCCSID). For batch jobs, the CCSID of the current job is used as 
the default CCSID for the submitted job. 


You can change the CCSID of a job by using the Change Job (CHGJOB) command. Make a note of the 
current job CCSID. You can use it later to reset the job CCSID to its original value, if necessary. The new 
CCSID value is reflected in the job immediately. The job DFTCCSID cannot be changed. To retrieve the 
CCSID or DFTCCSID for a job, use the Retrieve Job Attributes (RTVJOBAJ command or the Retrieve Job 
Information application programming interface (API). Interactively, use the Work with Job 
command and select the Display Job Definition Attributes option on the Work with Job display. 


Workstation function management: Workstation function management involves working with: 
* Display files 

* Printer files 

* Panel groups 


All source files on the system are tagged with a coded character set identifier (CCSID). 


Display files: \When a display file object is created, it is tagged with the coded character set identifier 
(CCSID) of the source file. At compile time: 


¢ All character data is read from the primary source file without any character conversion being 
performed. 


¢ User message text (identified by the MSGCONI keyword in DDS) remains the same because it is 
assumed to be in the same CCSID as the primary source file. 


At run time, the constant data is converted based on the CHRID parameter value used to create the 
display file object. This conversion is optional and can occur only when the CHRID is set to *JOBCCSID or 
indirectly with (CHRIDCTU. This conversion is from the display file CCSID to the character identifier 
(CHRID) of the device. The field-level keyword NOCCSID (no coded character set identifier) allows the 
user to specify fields within the DDS that are never to be converted. 


Note: To use data management support of CCSIDs, you must change source physical files tagged with 

CCSID 65595 to a CCSID value that is associated with the data. See Changing the CCSIN of a physical 
for more information. 

CHRID parameter on the Create Display File command 


The CHRID parameter on the Create Display File [(CRTDSPF) command affects the conversion that 
occurs for the display file. 


If the *JOBCCSID value is specified on the CHRID parameter of the CRTDSPF command: 
¢ Input characters are converted from the device character identifier (CHRID) to the job CCSID. 
¢ Character data is sent to output-capable fields and converted from the job CCSID to the device CHRID. 
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* Constant text from the display file is converted from the CCSID of the display file to the CHRID of the 
device. 

* All message files are tagged with a CCSID. Message text is converted from the CCSID of the message 
file to the CHRID of the device. When message files are tagged with a CCSID of 65535 (the system 
default), it is assumed that the contents of the message files are already in the CHRID of the device. To 
ensure appropriate conversions occur, you can enable CCSID support for messages. See 
Seen ae more information on enabling CCSID support for messages. 

* Message replacement data is converted from the CCSID of the job, or from the CCSID of the display 
file, to the CHRID of the device. 

* All status messages that are tagged with a CCSID other than 65535 are converted to the CHRID of the 
device. 


¢ Message text for messages on a message line or in a message subfile (identified by the 
ERRMSGID] and keywords in DDS) is converted from the message file CCSID to 
the device CHRID. 


If a specific value is specified for the CHRID parameter on the CRTDSPF command, conversion is done 
between the CHRID specified on the CRTDSPF command and the CHRID of the device. This conversion 
affects only fields defined with the CHRID DDS keyword. 


If the *DEVD value is specified on the CHRID parameter of the CRTDSPF command, no conversion is 
performed. This is the default setting. 


Migration of display files with CCSID 65535 


All source files in Version 3 of the OS/400 licensed program have an implicit CCSID value of 65535. To 
have appropriate CCSID support, display files must be recompiled with a source file that has a CCSID 
value other than 65535 if either of the following are true: 


¢ The display file was originally compiled from a source file with a CCSID value of 65535. 


* The display file was originally compiled prior to Version 2 Release 3 Modification 0 of the OS/400 
licensed program. 


By recompiling, the display file object is tagged and all necessary conversions take place when needed. 
No conversions take place if the source files are explicitly tagged CCSID 65535. 


Printer files: When a printer file object is created, it is tagged with the coded character set identifier 
(CCSID) of the source file. Processing of the source files for printer files is the same as for display files. At 
compile time, all character data is read from the primary source file without any character conversion being 
performed. 


When aa to the device, if the *JOBCCSID value is specified on the CHRID parameter of the 
command: 


* Constant text from an externally described printer file is converted from the CCSID of the printer file to 
the CCSID of the job. 


¢ Character data sent to output fields is assumed to be already converted to the job CCSID. 


If the printer data stream is tagged with the character identifier (CHRID) derived from the CCSID of the 
job, the CHRID value is used by the printer to interpret the data. The CHRID value is ignored for printers 
not supporting this function. 


If a specific value is set for the CHRID parameter on the CRTPRTF command: 


* For externally described printer files, fields that specify the CHRID DDS keyword use the CHRID value 
specified on the printer file. The remainder of the file is printed as if *DEVD was specified for the CHRID 
parameter on the CRTPRTF command. 
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¢ For program-described printer files, the printer data stream uses the CHRID value specified on the 


printer file. 


If the *DEVD parameter is specified on the CHRID parameter of the CRTPRTF command, no conversion 


is performed. 


The CHRID information is determined by either the printer hardware or by the device description. If the 
CHRID information is obtained from the device description, it is then sent to the printer. 


User interface manager menus and panel groups: Like display files and printer files, panel group objects 
and user interface manager (UIM) menus are tagged with the CCSID of the primary source file. The 
contents of embedded source members are converted to this CCSID. When the panel group or UIM menu 
is created with *JOBCCSID specified for the CHRID parameter, conversion is performed at run time. 
Conversion is performed between the CCSIDs of the panel group or menu, the job, and the CHRID of the 


display or printer. 


CCSID conversions of user interface manager menu and panel groups 


The following CCSID conversions occur for displays of panel groups and UIM menus: 

* Text in the panel group is converted from the panel group CCSID to the device CHRID. 

¢ Text in the UIM menu is converted from the UIM menu CCSID to the CHRID of the device. 
* Variables from the user job are converted from the job CCSID to the device CHRID. 

¢ Variables from the job are converted from the CHRID of the device to the job CCSID. 


¢ Online help information imported from a different panel group is converted from the imported panel 
group CCSID to the device CHRID. 


CCSID conversions when printing UIM menus and panel groups 


CCSID conversions for printed UIM menus and panel groups are shown in the following table. In this table, 
Xxx and yyy are explicitly assigned CCSID values. For example, a printer file CHRID is explicitly assigned 
a value of 00697 00037. The panel group is set to *JOBCCSID. The panel group constant text is 
converted from the panel group primary source file tagged with CCSID 00500 to the printer file CHRID 


00697 00037. 


Printer file CHRID is 


And the panel group or 
menu CCSID is xxx 


or *JOBCCSID 


or *DEVD 


yyy No conversion occurs for Panel group constant text is | No conversion occurs for 
panel group constant text. | converted from panel group | panel group constant text. 
primary source file CCSID 
to yyy. 
Variables with Variables with No conversion occurs for 
CHRID=PNLGRP on class |CHRID=PNLGRP on class_ | variables with 
tag are converted from xxx | tag are converted from job |CHRID=PNLGRP on class 
to yyy. CCSID to yyy. tag. 
No conversion occurs for Variables without No conversion occurs for 
variables without CHRID=PNLGRP on class_| variables without 
CHRID=PNLGRP on class_ | tag are converted from job |CHRID=PNLGRP on class 
tag. CCSID to YYY. tag. 
*JOBCCSID No conversion occurs for Panel group constant text is | Panel group constant text is 


panel group constant text. 


converted from panel group 
primary source file CCSID 
to job CCSID. 


converted from panel group 
primary source file CCSID 
to job CCSID. 
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Printer file CHRID is 


And the panel group or 
menu CCSID is xxx 


or *JOBCCSID 


or *DEVD 


Variables with 
CHRID=PNLGRP on class 
tag are converted from XXX 
to job CCSID. 


No conversion occurs for 
variables with 
CHRID=PNLGRP on class 
tag. 


No conversion occurs for 
variables with 
CHRID=PNLGRP on class 
tag. 


No conversion occurs for 
variables without 
CHRID=PNLGRP on class 
tag. 


No conversion occurs for 
variables without 
CHRID=PNLGRP on class 
tag. 


No conversion occurs for 
variables without 
CHRID=PNLGRP on class 
tag. 


*DEVD 


No conversion occurs for 
panel group constant text. 


Panel group constant text is 
converted from panel group 
primary source file CCSID 
to job CCSID. This 
conversion occurs because 
variables are in the job 
CCSID and the device 
CHRID is unknown. 


No conversion occurs for 
panel group constant text. 


No conversion occurs for 
variables with 
CHRID=PNLGRP on class 
tag. 


No conversion occurs for 
variables with 
CHRID=PNLGRP on class 
tag. 


No conversion occurs for 
variables with 
CHRID=PNLGRP on class 
tag. 


No conversion occurs for 
variables without 
CHRID=PNLGRP on class 
tag. 


No conversion occurs for 
variables without 
CHRID=PNLGRP on class 
tag. 


No conversion occurs for 
variables without 
CHRID=PNLGRP on class 
tag. 


Change the CCSID of a physical file 
You can use the Change Physical File (CHGPE)) command to change the coded character set identifier 


(CCSID) of a physical file. 


However, a physical file cannot be changed if one or more of the following conditions exist when working 

with a logical file defined over a physical file: 

* The logical file has a sort sequence table associated with the CCSID of the physical file and the CCSID 
you want to change to is incompatible. That is, conversion between the original CCSID and the CCSID 
you want to change to is not allowed because all the characters of the original CCSID are not in the 


new CCSID. 


* The logical file has a sort sequence table associated with the CCSID of the physical file and the CCSID 
you want to change to is incompatible. Additionally, the logical file has fields defined with CCSIDs that 
are not compatible to the new CCSID you want to change the physical file to. Again, conversion 
between the original CCSID and the CCSID you want to change to is not allowed because all the 
characters of the original CCSID of the logical file or the fields with specific CCSIDs are not in the new 


CCSID. 


* Aselect/omit or join logical file, or both that performs select/omits or joins between physical file fields 
that have different CCSIDs. 


* A join logical file with a sort sequence table such that the CCSID of the logical file’s secondary access 
path is different than the CCSID to which the physical file is being changed. 


Graphic character (data) sort implementation 
The following links describe the OS/400 implementation of sorting, or sequencing, characters (data). The 
server allows you to customize the sequence in which single-byte and graphic characters are sorted. You 
can customize the sorting sequence of a set of characters using a sort sequence table. 
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If your application uses locales, you can use the sorting support provided by the LC_COLLATE] locale 
category. 


Use the following links to find additional information about character graphic sort implementation: 


For more information 


See Kart sequences! 


Sort sequence types: A set of shared-weight and unique-weight sort sequence tables for SBCS 
languages is provided on servers. A shared-weight sequence is a sort sequence in which some graphic 
characters may have the same weight as some other characters in the sequence. Those with the same 
weight sort together as though they were the same character. For example, the letters a and A might both 
have the same value 24. This ensures that words such as able and Able are kept together in a list. In a 
simple sort table, a and A might share the value 24, and b and B might share the value 25 and so on. 


A unique-weight sequence is a sort sequence in which each graphic character has a weight different from 
the weight of every other graphic character in the sequence. 


Sort sequence scenarios: The following table shows characters you can sort using a binary, a 
shared-weight, and a unique-weight sort sequence for the Danish code page 00277. 


Code point in | Shared sort Unique sort 
Character name Character illustration code page 277 | weight weight 
AE ligature & X’7B’ 96 183 
O slash 6 X’7C’ 97 187 
A overcircle A X’5B’ 98 191 
N _— 
Latin capital N X’D5 83 132 
: Zz, — 
Latin capital Z X’E9 95 181 
O umlaut O X’EC’ 97 189 
A or 
Latin capital A x’C1 70 77 


Using the information in the previous table, the characters are sorted in ascending order as shown in the 
following table. 


Position in ascending order Binary sort Shared weight sort Unique weight sort 
First A overcircle Latin capital A Latin capital A 
Second AE ligature Latin capital N Latin capital N 

Third O slash Latin capital Z Latin capital Z 
Fourth Latin capital A AE ligature AE ligature 
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Position in ascending order Binary sort Shared weight sort Unique weight sort 
Fifth Latin capital N O umlaut O slash 

Sixth Latin capital Z O slash O umlaut 

Seventh O umlaut A overcircle A overcircle 


The following table shows an example of a shared-weight sort sequence, a unique weight sort sequence, 
and the binary sort sequence for English code page 00037. 


Binary sort sequence 


Shared-weight sort sequence using 
LANGID(ENU) and 
SRTSEQ(*LANGIDSHR) 


Unique-weight sort sequence using 
LANGID(ENU) and 
SRTSEQ(*LANGIDUNQ) 


Jones, Mary JOHNSON, JOHN JOHNSON, JOHN 
JOHNSON, JOHN JONES, MARTIN Jones, Mary 
JONES, MARTIN Jones, Mary JONES, MARTIN 
Smith, Ron SMITH, ROBERT Smith, Ron 
SMITH, ROBERT Smith, Ron SMITH, ROBERT 


Sort sequence support: The sort sequence support is provided in the following OS/400 functions. 
* A.user interface for creating new tables based on system-supplied sort sequence tables 

¢ The Work with Tables [(WRKTBLII command for creating and displaying tables 

* The Create Table (CRTTBL] command for creating tables 

¢ CL, ILE RPG IV, and ILE COBOL for compilers. 

¢ Program support 

¢ Database management support 

¢ Other system components support 


Sort sequence support in programs: You can assign sort sequences to programs used for ordering and 

comparing data. You assign a sort sequence to a program by specifying the sort sequence to be used at 

compile time. Specify the sort sequence to be used with the sort sequence (SRTSEQ) parameter and 

language identifier (LANGID) parameters of the create program commands. Valid SRTSEQ parameter 

values are: 

¢ SRTSEQ(*HEX) means that no sort sequence should be used (hexadecimal sorting). 

* SRTSEQ(*LANGIDUNQ) or SRTSEQ(*LANGIDSHR) means that the unique- or shared-weight sort 
sequence, determined by the LANGID parameter, should be used. 

* Aname for the system-supplied or user-supplied sort sequence name can be specified explicitly on the 
SRTSEQ parameter. If you explicitly specify a sort sequence name, the LANGID parameter is ignored. 

¢ SRTSEQ(*JOB) or LANGID(*JOB) means that the sort sequence to be used is determined by the value 
associated with the job when the program is created. 

* SRTSEQ(*JOBRUN) or LANGID(*JOBRUN) means that the sort sequence to be used is determined by 
the values from the job when the program is run. 


The first three options assign the sort sequence to the program object at creation time. This sequence is 
always used when the program is run. Using the *JOBRUN value on the SRTSEQ or LANGID parameters, 
however, provides the possibility for dynamically assigning sort sequence to the program. 

Sort sequence support in work management: Work management involves the assigning of the SRTSEQ 
value at the job level, the user profile level, and the system value level. 
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Sort sequence support at the job level: A sort sequence (SRTSEQ) value is assigned to a job. It is valid 
on the Submit Job (SBMJOB), Batch Job (BCHJOB), and the Change Job (CHGJOB) commands. If a 
program is created with SRTSEQ(*JOB), the sort sequence is set from the job sort sequence. If a program 
is created with SRTSEQ(*JOBRUN), the sort sequence is set from the job sort sequence at run time. 


Sort sequence support at the user profile level: The user profile assigns a SRTSEQ value to a user 
and, by default, to all jobs running under this user profile. The user profile SRTSEQ value defaults to the 
sort sequence system value (QSRTSEQ). 


Sort sequence support at the system value level: The QSRTSEQ system value defines a sort 
sequence that can be referred to by other objects. The QGRTSEQ system value should be set according 


to the oe of the oer analiape used on the system. For more information on QGRTSEQ, see 


Sort sequence support in database management:_ Database management supports the SRTSEQ and 
LANGID parameters on the Create Physical File ({CRTPE)) and Create Logical File (CRTLE) commands. 


The LANGID and SRTSEQ parameters determine a sort sequence table. The sort sequence table is 
captured at file creation time and is stored as an attribute of the file. The SRTSEQ job attribute has no 
effect on the processing of an existing database file. The sort sequence table associated with the file is 
used for key sequencing, select logic fields and omit logic fields, and for join field functions. 


The ALTSEQ keyword in DDS can also be used to specify a sort sequence table. The ALTSEQ keyword 
applies only to the key fields, not to the select logic fields and the omit logic fields. If the SRTSEQ 
parameter is specified on the CRTPF command or the CRTLF commands and the ALTSEQ keyword in the 
DDS source file specify a sort sequence table, an error message is sent and the file is not created. 


The default SRTSEQ parameter on CRTPF and CRTLF commands is *SRC, which indicates that the sort 
sequence table on the ALTSEQ keyword should be used. If ALTSEQ is not used in DDS, the SRTSEQ 
attribute of the job determines the file attributes when creating or changing the file. 


How sort sequences are specified for database management 


Sort sequence tables can be specified in the following areas: 

* Query for iSeries support 
External sort sequence tables (including those shipped with the system) and user-defined tables can be 
specified. 

* DB2 Query Manager and SQL Development Kit for iSeries 


The Create Structured Query Language xxx (CRTSQLxxx) commands and the Start Structured Query 
Language command support the SRTSEQ and LANGID parameters. 


A sort sequence table can be specified when a query object is being defined with the Work with Queries 
display. The sort sequence (SRTSEQ) value and language identifier (_LANGID) value are specified on 
the Specify Sort Sequence display. 


¢ DB2 UDB for iSeries Query Management 


The Create Query Management Query (CBTQMORY) command supports the SRTSEQ and LANGID 
parameters. 


For more information on sort sequence support for database programming, see the [IDB2 UDB fod 
topic. 


Sort sequence support in other system components: Sort sequence support is found in the following 
components of the system: 


° (Create Control Language Program) command 
The LANGID and SRTSEQ parameters are supported. 
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- [DSPPGM (Display Program) command 
The LANGID and SRTSEQ values that were specified when the program was created are displayed. 
- CRTDSPH (Create Display File) command 
The LANGID and SRTSEQ parameters are supported. The values of the RANGE, VALUES, and COMP 
keywords are validated when the display file is compiled. 
* High-level languages 
Using ILE COBOL and ILE RPG IV languages, you can specify SRTSEQ and LANGID values directly 
on the Create Bound Program (CRTBNDXXX) commands. Original Program Model RPG and COBOL 
compilers use the Create Program (CRTXXXPGM) commands. With ILE C, you can also specify 
SRTSEQ and LANGID values when you create a locale. You can then associate the locale with a 
program. 


The transfer function allows a sort sequence table to be specified when performing queries on database 
files and SQL tables. 


Sort sequence tables: A sort sequence table is an object that contains the weight of each single-byte 
graphic character within a specified coded character set identifier (CcsIp}. The system-recognized 
identifier for the sort sequence table object type is *TBL. 


Depending on your requirements, you can define a table to have either a unique weight for each graphic 
character or shared weights for some graphic characters. If you define a table that contains unique 
weights for each character within the character set, your table is known as a unique-weight table. If you 
define a table that contains some graphic characters that share the same weight, your table is known as a 
shared-weight table. For example, if you want to sort the graphic character capital letter A and the graphic 
character small letter a together, you would define a shared-weight table. If you want to sort these graphic 
characters separately, you would define a unique-weight table. 


A set of sort sequence tables is shipped with the servers. This set of tables defines both unique- and 
shared-weight sort sequences for all SBCS languages. 


Sort sequence table implementation notes 


Sort sequence support does not take into consideration the following: 


* Special cases of single characters that should be handled as multiple characters (such as the German 
character s sharp). 


* Sequences of characters that should be treated as a single character (such as the Danish aa, 
Hungarian ly, Serbian lj, Spanish Il). 


¢ Nonalphanumeric characters that should be ignored because they are embedded in alphanumeric 
strings (such as the hyphen in co-op). 


¢ Prefixes that should be ignored (such as Van der in the name Van der Pool). 
¢ Program-described files. 
¢ DBCS code pages. 


If a sort sequence table has a weight other than hexadecimal 40 assigned to the blank character, 
unpredictable results can occur when strings of unequal lengths are compared. 


Sort sequence tables shipped with the system 


You can use the WRKTBL command to view the contents of the sort sequence tables that are shipped 
with OS/400. The tables are located in the QSYS library. 


When looking at these tables, consider the following: 
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* Several tables shipped with the system represent a single sort sequence, each encoded with a different 
coded character set identifier (CCSID) value. Not all of the characters in a given sort sequence exist in 
every CCSID in which the sort sequence is encoded. 


¢ Use the language identifier (LANGID) parameter and the sort sequence (SRTSEQ) parameter to access 
the unique-weight tables (*“LANGIDUNQ) or the shared-weight tables (*LANGIDSHR). 


* When using the sort sequence, the relative weights shown in these tables differ from the actual weights 
in the sort sequence table on the system. The relative weights shown in these tables are examples only. 


* The relative unique weight of a character is shown by the order of the characters in the sort sequence 
table. The relative unique weight is determined by assigning a weight of 1 to the first character in the 
sort sequence table and incrementing by 1 for each of the following characters until the end of the table 
is reached. 


¢ GCGID is the graphic character global identifier. 


For example, the lArabid sort sequence table shows the relative sort sequence weights for characters that 
are sorted using the Arabic sort sequence table. 


How to build sort sequence tables 


To create a user-defined sort sequence table, copy an existing sort sequence table using the Create Table 
command, and then modifying the copy of the table. Table functions allow you to do the 
following: 


¢ Use a definition stored in a source member. 
* Create a table based on another sort sequence table using an interactive interface. 


You can create a sort sequence table (MYTEST) from a copy of an existing table using the following 
CRTTBL command: 


CRTTBL TBL(MYTEST) SRCFILE(*PROMPT) TBLTYPE(*SRTSEQ) 
BASESRTSEQ(QSYS/QLA10025S) CCSID(037) 


This command displays a sort sequence table that you can modify. Your table is created from a function 
key on this display. Your resulting table has a coded character set identifier (CCSID) value of 00037. The 
table is named MYTEST and is stored in the current library. 


The following table shows one way in which the resulting characters may be shown on the first display of 
the MYTEST sort sequence table. The actual panel shows characters instead of text descriptions. For 
example, the character shown for sequence 0100 would be a question mark (?), and the character shown 
for sequence 0070 would be a colon (:). 


Note: The characters that you actually see on the first display of the MYTEST sort sequence table may 
vary, depending on the device that you use. 


Sequence Character 

0010 Equal sign 

0020 Overline 

0030 (SHY) 

0040 Hyphen 

0050 Comma 

0060 Semi-colon 

0070 Colon 

0080 Exclamation mark 

0090 Inverted exclamation mark 
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Sequence Character 

0100 Question mark 

0110 Inverted question mark 
0120 Slash 

0130 Period 

0140 Acute accent mark 
0150 Grave accent mark 
0160 Caret 

0170 Right square bracket 
0180 Tilde 

0190 Small multiply dot 
0200 Comma 


You can make changes to the tables to move characters in each code page to the preferred position for 
the national language sort sequence table. The ordering is done by increments of 10. Therefore, the first 
value is 10, then 20, and so on. If some characters have a shared weight, these groups of characters 
have the same sequenced weight. 


CCSID support for messages 

You can use CCSID support for handling messages and message catalogs on OS/400. You can send 
messages tagged with one CCSID to users with a different CCSID. You can use CCSID support to handle 
messages by using commands and application programming interfaces. 


Note: You do not need a multinational character set (MNCS) when using CCSIDs for handling messages. 


For example, if you do not set CCSID support on, the following message, encoded in CCSID 00037: 
Joe, I need to see you right away! 


appears to a user with CCSID 00500 as 
Joe, I need to see you right away] 


Instead of seeing an exclamation mark (!), Joe sees a right square bracket (J). If you set CCSID support 
on, the text in a message encoded in CCSID 00037 is converted to CCSID 00500. Both the person 
sending the message and the person receiving the message see identical text. 


CCSID support helps preserve data integrity in messages. As you read through this information, you will 
see other advantages to using CCSID support for messages. 


(bject-level CCSIDs 


° Message files 
— Message-level support 
— Message description-level support 
* Message queues 
¢ Job message queues 
¢ System reply lists 
* History log 


The following topics provide detailed information about message support: 
¢ Setting up CCSID support for message handling 
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¢ CCSID support for message catalogs 

* Converted character replacement data type field 

¢ Retrieve messages 

* Receive messages 

* Common questions about CCSID support for message handling 


For more information 
See Message catalogs for a description of general OS/400 globalization support for messages. 


The following message handling commands support CCSIDs: 
: (Create Message File) 

- [cRTusGd (Create Message Queue) 

- CcHGMsGd (Change Message Queue) 

- [ADDRPYLA (Add Reply List Entry) 

» (CCHGRPYLA (Change Reply List Entry) 

- (cHGuMsGcpI (Change Message Description) 
- [rvusG (Retrieve Message) 

- cvusa (Receive Message) 

- [SNDBRKMSG] (Send Break Message) 

- SNDusG (Send Message) 

- ISNDPGMusG (Send Program Message) 

° ISNDRPY| (Send Reply) 

- ISNDUSRMSG (Send user Message) 


Handle messages with a specific object-level CCSID: The following objects support CCSIDs: 
* Message files 

* Message queues 

* Job message queues 

¢ System reply lists 

* History log 


Each of these objects has an object-level CCSID. The object-level CCSID is the CCSID in which all the 
messages in that object are encoded. 


See the following topics for details about object-level CCSIDs: 
* Object-level coded character set identifiers 65535 and 65534 
¢ Using a specific object-level CCSID for handling messages 


Object-level coded character set identifier 65535: CCSID 65535 is the default object-level CCSID for 
message files and message queues. If an object has a CCSID of 65535, no conversions occur when 
adding messages to that object or when receiving messages from that object. Use CCSID 65535 if you do 
not want CCSID processing to occur. 


CCSID 65535 is also known as *HEX. 
Object-level coded character set identifier 65534: CCSID 65534 is the default object-level CCSID for job 
message queues, system reply lists, and the history log. If the CCSID of an object is 65534, each 


message in the object has its own CCSID. No conversion occurs when a message is added to the object. 
When a message is received, it is converted based on the CCSID stored with the message. 
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CCSID 65534 is also known as *MSG or *MSGD. 


CCSID 65534 is the preferred setting for object-level CCSIDs. An object-level CCSID of 65534 requires 
fewer CCSID conversions. Fewer CCSID conversions of text result in better performance and improved 
data integrity. 


Using a specific object-level CCSID for handling messages: \f the CCSID of an object is any value other 

than 65535 or 65534, all messages in that object are considered encoded in that CCSID. The object-level 

CCSID overrides the CCSID stored with the messages. Use this type of object-level CCSID if both of the 

following are true: 

* You expect the object to be sent messages or have message descriptions added in a CCSID different 
from the CCSID in which you would receive the messages or retrieve the message descriptions. 


* You intend to receive the same message or retrieve the same message description many times. 


If these conditions are true, set the object-level CCSID to the CCSID in which you will receive or retrieve 
the messages. When the system uses this type of object-level CCSID, the message text or data is 
converted at the time the message is sent or is added to the object. No conversion occurs when the 
message is received or retrieved because the text and data are already in the CCSID requested on the 
receive operation or retrieve operation. 


Do not change system-supplied message files to use this type of object-level CCSID. Each 
system-supplied message description is tagged separately. No one object-level CCSID value can 
represent all of the message descriptions in the message file. Changing the object-level CCSID of a 
system-supplied message file to anything other than CCSID 65535 or CCSID 65534 may cause 
unpredictable results. 


Message-level support: When a message is sent to a message queue, you must communicate the 
CCSID of the replacement data or the impromptu message text to the operating system. Use the CCSID 
parameter on any of the send message commands or APIs to communicate this CCSID to the operating 
system. 


The default CCSID setting in the send message commands and APIs indicate that the replacement data or 
impromptu message text is in the CCSID of the job that is running the command or API. You can override 
the job default CCSID value by specifying a different CCSID value. 


If the replacement data or impromptu message text supplied is not in the CCSID specified, incorrect 
conversion results may occur. See Sean ESTOS ST OEE, if this occurs. 
Determining the CCSID of a message file 


To determine the CCSID of a message file, type: 
WRKMSGD MSGF (MYLIB/MYMSGF) 


where MYLIB is the library in which the message file is stored and MYMSGF is the name of the message 
file. 


Next, press F22 (Display list details). 


You can also use the QMHRMFAT (Retrieve Message File Attributes) application program interface (API) 
to determine the CCSID of a message file. 


For job message queues, system reply lists, and the history log, the object-level CCSID is always 65534. 


You cannot change nor display object-level CCSIDs for job message queues, system reply lists, and the 
history log. 
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How the message-level CCSID is set 


See the following topics for information on how the message-level CCSID is set: 
¢ Message-level CCSID with a message queue CCSID of 65535 or 65534 

¢ Message-level CCSID with a specific message queue CCSID 

¢ Message-level CCSID when a message queue CCSID conversion error occurs 
¢ Message-level CCSID when a message is a stored message 


Message-level CCSID with a message queue CCSID of 65535 or 65534: When a message is sent to the 
message queue and the CCSID of the message queue is 65535 or 65534, no conversion occurs on the 
message. The message-level CCSID is set to the CCSID specified. 


For example, message queue MYMSGQ has a CCSID of 65534. You enter the following Send Message 
command: 


SNDMSG MSG('MSG #1') CCSID(37) TOMSGQ(MYLIB/MYMSGQ) 


The impromptu message text, MSG #1, is not converted when added to the message queue. The 
message is tagged with CCSID 00037. 


Message-level CCSID with a specific message queue CCS/D: When a message is sent to the message 
queue and the CCSID of the message queue is something other than 65535 or 65534, the replacement 
data or impromptu message text is converted to the CCSID of the message queue. The message is then 
tagged with the CCSID of the message queue. 


For example, message queue MYMSGQ has a CCSID of 00277. The replacement data for TST0002 is 
defined as *CCHAR data. You enter the following Send Program Message command: 


SNDPGMMSG MSGDTA(X'0006D4E2C7407BF2') MSGID(TSTO002) MSGF(MYMSGF) 
CCSID(37) TOMSGQ(MYLIB/MYMSGQ) 


The replacement data is converted from CCSID 00037 to CCSID 00277 before it is sent to the message 
queue. X’0006’ is the length required for variable-length fields. X’D4E2C7407BF2’ is MSG #2 on code 
page 00037. The number sign (#), X’7B’ on code page 00037, is converted to a number sign, X’4A’ on 
code page 00277. All other code points do not change during the conversion because they are the same 
on both code page 00037 and code page 00277. 


When the replacement data or impromptu message text of a message is 65535 and it is sent toa 
message queue with a CCSID other than 65535 or 65534, no conversion occurs. However, the message 
is tagged with the CCSID of the message queue. Therefore, messages can be tagged with an incorrect 
CCSID when you send them to a message queue with a CCSID that overrides the message-level CCSID. 


For example, message queue MYMSGQ has a CCSID of 00277. You enter the following Send Message 
command: 


SNDMSG MSG('MSG #2') TOMSGQ(MYLIB/MYMSGQ) CCSID(*HEX) 


The impromptu message text MSG #2 is not converted before it is sent to the message queue. Although 
the impromptu message text is not converted to CCSID 00277, it is displayed using CCSID 00277. Unless 
you entered the Send Message command from a device configured to support code page 00277, you lost 
the integrity of the impromptu message text. 


Message-level CCSID when a message queue CCSID conversion error occurs: \f a conversion error 
occurs while sending a message to a message queue, the message is still sent to the message queue. 
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However, the impromptu text or data of the message is not converted. A diagnostic message is sent and 
the message is tagged with the message-level CCSID specified on the send command or API, not with the 
CCSID of the message queue. 


You can recover the replacement data or impromptu message text with the proper CCSID setting. First, 
set the message queue CCSID to 65534. Then use the Receive Message command or API to return the 
correct message-level CCSID. 


Message-level CCSID when a message is a stored message: lf a message is a stored message, the 
message-level CCSID applies only to *CCHAR replacement data. The CCSID of the first- and second-level 
text of the message is retrieved from the message file. 


Replies to stored messages are never converted from one CCSID to another. Only replies to impromptu 
messages are affected by CCSID processing. 


Message description-level support: \When a message description is added to a message file, the 
CCSID of the i ar text must be communicated to the operating system. You can use the CCSID 
parameter on the or the command to communicate this CCSID to the operating 
system. 


The default settings of these commands indicate that the message text is in the CCSID of the job that is 
running the command. You can change this value by specifying a different CCSID value. You can also 
change this value by indicating that no CCSID processing should occur. You indicate that no CCSID 
processing should occur on the message text by specifying a CCSID value of 65535 (*HEX). 


If you set CCSID processing on, system-supplied display files and printer files that display or print 
message descriptions convert the CCSID of the message file to the CCSID of the job before displaying 
them or printing them. To print and display the messages correctly, your job CCSID setting must be the 
same as the code page portion of your device CHRID setting. 


All message descriptions that existed in a message file that was created prior to V3R1 are tagged with 
CCSID 65535 on the first use or handling of that message description. 


If the text of a message is not in the CCSID specified, incorrect conversion results may occur. See [Can I 
e if this occurs. 


CO Ne 
How the CCSID of a message description is set 


To set the message description-level CCSID, consider the following topics: 
* Message file with a CCSID of 65535 or 65534 
* Message file with a specific CCSID 


How to change the CCSID of a message description 

See Changing the CCSID of a message description for details. 

Message file with a CCSID of 65535 or 65534: \f the CCSID of the message file is 65535 or 65534, no 
conversion occurs on the message description when it is added to the file. The message description 


CCSID is set to the CCSID specified on the ADDMSGO or command. 


For example, a message file MYMSGF has a CCSID of 65534. The job that is running is in CCSID 00037. 
You enter an ADDMSGD command, as follows: 


ADDMSGD MSG('MSG #1') MSGID(TSTO0@01) MSGF (MYMSGF) 
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The message text, MSG #1, is not converted when added to the message file. The message text is tagged 
00037 because the CCSID parameter was not coded on the ADDMSGD command and the default CCSID 
parameter is *JOB. 


Message file with a specific CCSID: \f the CCSID of the message file is something other than 65535 or 
65534, the first- and second-level text of the message description is converted from the CCSID specified 
to the CCSID of the message file. It is then tagged with the CCSID of the message file. 


For example, message file MYMSGF has a CCSID of 00277. The job that is running is in CCSID 00037. 
You enter the following command: 


ADDMSGD MSG('MSG #2') MSGID(TST0002) MSGF(MYMSGF) CCSID(37) 


Message ’MSG #2’ is converted from CCSID 00037 to CCSID 00277 before it is added to the message 
file. The number sign (#), X’7B’ on code page 00037, is converted to the number sign (#), X’4A’, on code 
page 00277. No other code points change during the conversion because they are the same on both code 
page 00037 and code page 00277. 


When the text of a message description is specified as 65535 and it is added to a message file, no 
conversion occurs. If the CCSID of the message file is not 65535 or 65534, the message text is tagged 
with the CCSID of the message file. 


When the message file CCSID is not 65535 or 65534, the message file CCSID overrides message 
description CCSIDs. Keep this rule in mind when adding and changing message descriptions to a 
message file with a CCSID other than 65535 or 65534. Otherwise, a message description can be marked 
incorrectly. 


For example, message file MYMSGF has a CCSID of 00277. You enter the following command: 
ADDMSGD MSG('MSG #2') MSGID(TSTOQ02) MSGF(MYMSGF) CCSID(*HEX) 


Message text ’MSG #2’ is not converted before it is added to the message file. Because the CCSID of the 
message file is 00277, the message text is tagged with CCSID 00277. 


If the command was run in a job CCSID where the number sign (#) occupies a code point different than 
the code point for the number sign on code page 00277, the message is displayed incorrectly. 


A conversion error may occur while adding or changing a message description in a message file. If a 
conversion error occurs, the message description is still either added to or changed in the message file. 
The text of the message description, however, is not converted. A diagnostic message is sent and the 
message description is tagged with the CCSID specified, not with the CCSID of the message file. 


When a conversion error occurs, you can recover the correct CCSID tagging for the message description 
by setting the message file CCSID to 65534. Then you can retrieve the correct CCSID for the message 
description using the Retrieve Message command or the Retrieve Message (QMHBTvM) API. 


The CCSID of a message description applies only to first- and second-level message text. 


Change the CCSID of a message description: When you take the option to change a message 
description from the Work with Message Descriptions display, all current values for the selected message 
description are retrieved and placed on the prompt display. The first- and second-level text are converted 
from the CCSID of the message file to the CCSID of the job before they are put on the prompt display. 


*JOB is displayed for the CCSID keyword and has two different meanings depending on what you do on 
the prompt display. If you change any part of the first- or second-level text, “JOB means that the text is 
converted from the CCSID of the job to the CCSID of the message file when you press the Enter key. If 
the text is unchanged, *JOB works like *SAME, and none of the following are changed: 


° The first-level message text 
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¢ The second-level message text 
* The CCSID of the message description 


Both the first- and second-level text of a message description must be in the same CCSID. If you change 
the CCSID of one level, the system automatically converts the other level to match. 


Example: Changing a message description 


The CCSID of message file MYMSGF is 65534. The CCSID of the job that is running WRKMSGD is 
00277. The CCSID of the message description is 00037. 


Select option 2 to change a message description. The text of the message description is converted from 
CCSID 00037 to 00277 before being placed on the prompt display. 


If only the first-level text is changed, the 00277-tagged text is stored in the message file. The CCSID of 
the message description is changed to 00277. The 00277-tagged second-level text is also stored in the 
message file to keep both the first- and second-level text in the same CCSID. 


Message queues: lf you set CCSID processing on, system-supplied display files and printer files that 
display or print messages convert the CCSID of the message queue to the CCSID of the job before 
displaying or printing the messages. To print and display the messages correctly, your job CCSID setting 
must be the same as the code page portion of your device CHRID setting. 


All messages that existed on a message queue that was created in a release prior to V3R1 are assigned 
CCSID 65535 on the first use of that message. 


Determining the CCSID of a message queue 


To determine the CCSID of a message queue, type: 
DSPMSG MSGQ(MYLIB/MYMSGQ) ASTLVL(*BASIC) 


where MYLIB is the library in which the message queue is stored and MYMSGQ is the name of the 
message queue. 


Then press F22 (Display list details). 


! application program interface 


(API) to determine the CCSID of a message queue. 


3 g, and the the object-level CCSID is always 65534. 
You cannot change nor aepley object- level CCSIDs for ob message queues, system reply lists, and the 
history log. 


Job message queues: The CCSID for all job message queues is 65534. You cannot change or display 
this value. A job message queue CCSID of 65534 requires fewer CCSID conversions. Fewer CCSID 
conversions of text result in better performance and improved data integrity. 


The CCSID of each message in the job log is used for CCSID processing. No conversion occurs when a 
message is sent to the job log. 


Note: Request messages are always tagged with a CCSID of 65535 and are never converted. 


If you set CCSID processing on, system-supplied display files and printer files that display or print job logs 
convert the CCSID of the messages to the CCSID of the job before displaying or printing the messages. 
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To print and display the messages correctly, your job CCSID setting must be the same as the code page 
portion of your device CHRID setting. Status messages that appear on line 24 of a display are converted 
to the CCSID of the device before they are shown. 


For more information about Job message queues and CCSID support, see [History log. 


System reply list: The system reply list has a CCSID of 65534. You cannot change or display this value. 
The only part of the system reply list that is affected by CCSID processing is the Compare data field. If the 
Compare data field references replacement data that is defined as *CCHAR, the data being compared 
must be in a common CCSID before the comparison is done. 


Any reply list entry that has compare data is tagged with the CCSID supplied on the ADDRPYLEI or 
CHGEPYiE commands. When the system reply list is used, the replacement data is converted to the 
CCSID of the compare data before the comparison is made and before the message is sent to the 
message queue. This ensures that the data is in a common CCSID before the comparison is done. 


Example: System reply list and converted-character compare data 


Enter the following Add Reply List Entry command: 


ADDRPYLE SEQNBR(101) MSGID(TSTO0010) CMPDTA(X'00017B') RPY(*DFT) + 
CCSID(37) 


X’7B’ is the number sign (#) on code page 00037. X’0001’ is the length required for variable-length fields. 
The compare data is not converted when added to the system reply list. It is tagged with CCSID 00037. 
Message TST0010 has one replacement data field that is defined as *CCHAR with (“VARY 2) for its 
length. Message queue MYMSGQ has a CCSID of 00278. 


Send message TST0010 in a job that has the system reply list turned on using the following Send 
Program Message command: 


SNDPGMMSG MSGID(TST0010) MSGF(MYLIB/MYMSGF) MSGTYPE(*INQ) + 
TOMSGQ(MYLIB/MYMSGQ) MSGDTA(X'00014A') CCSID(277) 


The replacement data is converted from CCSID 00277 to CCSID 00037 and then compared with the 
compare data. The conversion results in replacement data X’00017B’. A match is found and the default 
reply is sent when this message is added to the message queue. 


When the message is added to the message queue, the replacement data is converted from CCSID 
00277 to CCSID 00278. The message queue CCSID does not matter when trying to match the compare 
data. The replacement data is converted to X’000163’ when it is sent to the message queue and tagged 
00278. X’63’ is the code point for the number sign (#) in code page 00278. 


History log: The history log is a database file that is tagged with CCSID 65535. You cannot change the 
CCSID of the history log. No conversions occur when you do database retrievals from the history file. 


You can use CCSID processing when working with the history log. The CCSID of the replacement data or 
impromptu message text is added to the history log record. If the history log record is for a stored 
message, CCSID processing occurs only for the *CCHAR replacement data in that record. 


You can retrieve a message from the history log and convert it into a specific CCSID by doing the 
following: 


1. Obtain the input variables &MSGFL, &MSGF, &MSGID, &MSGDTA, and &MDTACCSID, from the 


history log record. (See the EL Programming 


PDF for the layout of the history log record.) 
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2. Enter the following Retrieve Message command: 


RTVMSG MSGF(&MSGFL/&MSGF) ; MSGID(&MSGID); MSGDTA(&MSGDTA); + 
MDTACCSID(&MDTACCSID) ; MSG(&MSG) ; 


If you set CCSID processing on, system-supplied display files and printer files that display or print history 
log records convert the CCSID of the messages to the CCSID of the job before displaying or printing the 
messages. To print and display the messages correctly, your job CCSID setting must be the same as the 
code page portion of your device CHRID setting. 


Set up CCSID support for message handling: The default setting of the CCSID for creating message 
queues and message files is 65535. Most message files delivered with the operating system have a 
CCSID of 65535. 


Most message descriptions in system-supplied message files are tagged with a CCSID that corresponds to 
the national language version with which they are shipped. 


Some message descriptions are not assigned a CCSID that corresponds to the national language version. 
These message descriptions are tagged 65535 and are not converted when used. 


Messages sent to a message queue that has a CCSID of 65535 are not converted when placed on the 
queue. Message descriptions added to a message file that has a CCSID of 65535 are not converted when 
placed in the file. These messages and message descriptions are tagged with a CCSID associated with 
their text or data. By tagging them with a CCSID associated with their text or data, they are given the 
correct CCSID if the object-level CCSID is changed to 65534. 


You can set CCSID support on for handling a specific message queue. For example, to set CCSID 
handling on for message queue MYMSGQ in library MYLIB, type: 


CHGMSGQ MSGQ(MYLIB/MYMSGQ) CCSID(65534) 


The Change Message Queue (CHGMSGQ) command also allows you to turn on CCSID support for more 
than one message queue at a time. 


You can set CCSID support on for handling a specific message file. For example, to set CCSID handling 
on for message file MYMSGF in library MYLIB, type: 


CHGMSGF MSGF(MYLIB/MYMSGF) CCSID(65534) 


The Change Message File command also allows you to turn on CCSID support for more 
than one message file at a time. 


CCSID support for message catalogs: The Message catalog CCSID (CLGCCSID) parameter allows 
you to specify the CCSID for storing data in a message catalog. The Source file CCSID (SRCCCSID) 
parameter allows you to specify the CCSID of a source file. Data from the source is converted to the 
CCSID of the message catalog if the CCSIDs for both are not the same. This is also the default action. 
The source can be in any CCSID that supports conversion to any other CCSID. 


The CCSID of the original message catalog is used to update the message catalog. It can be single or 
mixed and in extended binary-coded decimal interchange code (EBCDIC), American National Standard 
Code for Information Interchange (ASCII), or UCS-2. If the catalog is a QSYS source file member that 
does not exist, the CCSID of the existing file is used. The value that is specified on the CLGCCSID 
parameter is used if the CCSID of the file is 65535. 


Converted character replacement data type field: A replacement data type field supports CCSID 
processing. This replacement data type field is called a convertible character field ("CCHAR). A *CCHAR 
replacement data type field is a variable-length field. This field may increase or decrease in length when 
the field is converted. 
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Example: Add a message description with CCSID support 


The following example shows how to add the message description TSTOO06 to message file MYMSGF. 
The message description has 2 replacement data type fields. One field is a character field length 10. The 
other field is a convertible character field with varying length. Use the ADDMSGO command as follows: 


ADDMSGD MSG('This is *CHAR &1; This is *CCHAR &2;') MSGID(TSTOQ06) + 
MSGF(MYLIB/MYMSGF) FMT((*CHAR 10) (*CCHAR *VARY 2)) 


Retrieve messages: The Retrieve Message (RTvmMsa) command and retrieve message (QMHRTVM)) 
application program interface (API) have a CCSID-to-convert-to parameter. This parameter determines 
which CCSID the first- and second-level text is converted to before the text is returned to the user. The 
Retrieve Message command and the Retrieve Message API also have a replacement data CCSID 
parameter. This parameter communicates the CCSID of the replacement data to the system. The 
replacement data CCSID applies only to the parts of the replacement data that correspond to *CCHAR 
type data. No other replacement data is converted. 


The Retrieve Message command and Retrieve Message API convert the first- and second-level text from 
the CCSID of the message file to the CCSID on the CCSID-to convert-to parameter. Any replacement data 
that is *CCHAR data is converted from the replacement data CCSID to the CCSID-to-convert-to CCSID 
before being substituted into the correct replacement variables. The default for both parameters is *JOB, 
which means that the CCSID of the job is used. 


Retrieve Message command CCSID return fields 


Three CCSID return fields are supported by the Retrieve Message (RTVMSG) command: 


Example 1: Retrieving a message with CCSID support 


Message file MYMSGF has a CCSID of 65534. The CCSID of the message description is used to 
determine the CCSID from which to convert the message text. The CCSID of the message description 
(TST0003) is 00037. The first-level text is: 


'MSG #3 is &1;' 


&1 is defined as a *CCHAR variable field with a length of (*VARY 2). Enter the following RTVMSG 
(Retrieve Message) command: 


RTVMSG MSGF(MYMSGF) MSGID(TSTO0@03) MSG(&MSG); CCSID(277) + 
MDTACCSID(277) MSGDTA(X'0002D6D2' ) 


In the message data, the first 2 bytes are a length field with a value of 2. All *VARY fields begin with a 
length. The next 2 characters are the actual *CCHAR data with a value of X’D6D2’. X’D6D2’ represents 
the characters O and K on code page 00277. 


The first-level text is converted from CCSID 00037 to CCSID 00277. The replacement data is not 
converted before it is substituted for &1; because the replacement data CCSID matches the 
CCSID-to-convert-to parameter. As a result, the text returned in the variable &MSG is: 


'MSG #3 is OK.' 
The code point for the number sign (#) is the only change that occurred in the conversion. The number 


sign was converted from code point X’7B’ in code page 00037 to code point X’4A’ in code page 00277. All 
other code points in the text of the message matched in code page 00037 and code page 00277. 
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Note: If the CCSID of a message file is 65535, no conversion occurs, even though the message 
description CCSID is 00037. The CCSID of the message file always takes precedence over the message 
description CCSID. 


Example 2: Using return fields and converted character data 


Message description TST0005 has the following first-level text: 
This is *CHAR &1; This is *CCHAR &2; 


The message description is defined in message file MYMSGF, which has a CCSID of 65535. &1; is 
defined as a *CHAR field of length 1. &2; is defined as a *CCHAR field (*VARY 2) in length. The CCSID of 
the message description does not matter because the CCSID of the message file is not 65534. You enter 
the following RTVMSG command: 


RTVMSG MSGF(MYMSGF) MSGID(TSTO0@05) MSG(&MSG); CCSID(260) + 
MDTACCSID(37) MSGDTA(X'5A00015A') TXTCCSID(&TXTCCSID) ; 


Note: X’5A’ is the exclamation point (!) on code page 00037. 


These are the returned values from the command: 
* &MSG = ’This is *CHAR. This is *CCHAR !.’ 
The EBCDIC value of the *CHAR character is X’5A’. X’5A’ appears as an acute accent ( 


f 


) on code page 00260. The *CHAR data did not convert because only *CCHAR data supports CCSID 
processing. The ’&1’ stayed at X’5A’, while ’&2’ converted to X’4F’. X’4F’ is the exclamation point on 
code page 00260. 

* &TXTCCSID = 65535 
The [EXICCSIDI variable is set to 65535 because no conversion occurred. When no conversion occurs, 
the CCSID (if it is not 65534) of the message file is returned. 


CCSID of the text returned (TXTCCSID) return field: TXTCCSID is the CCSID of the text returned. If a 
conversion occurs and is successful, this value is always equal to the CCSID-to-convert-to value. If a 
conversion occurs and is not successful, this is the CCSID of the message file unless the CCSID of the 
message file is 65534. If the CCSID of the message file is 65534, the CCSID of the message description 
is returned. 


For example, message file MYMSGF has a CCSID of 65534. Your program needs to know the CCSID of 
message description TST0004. Specify the RTVMSG command as follows: 


RTVMSG MSGF(MYMSGF) MSGID(TST0Q04) CCSID(*HEX) TXTCCSID(&TXTCCSID) ; 


The CCSID of the message description is returned in the variable &TXTCCSID because you specified 
*HEX for the CCSID-to-convert-to parameter. “HEX means no conversion is to occur. If no conversion 
occurs and the message file CCSID is 65534, the message description CCSID is returned. 


You can also obtain the message description CCSID from the Work with Message Descriptions 
display. 

1. On the WRKMSGD display, select option 5 to display details. 

2. From the Select Message Details to Display menu, select option 5 to display message attributes. 


3. Page forward to the CCSID value. The message description CCSID is shown if the CCSID of the 
message file is 65534. If the CCSID of the message file is not 65534, the CCSID of the message file is 
shown. 


CCSID conversion status indicator (TXTCCSTA) return field: TXTCCSTA is the text CCSID conversion 
status indicator. Return codes help you determine what happened when the system converted your 
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message text to the CCSID-to-convert-to parameter. Positive return code numbers indicate that your 
conversion was successful. A successful return code does not always indicate that a conversion occurred. 
Negative return code numbers indicate that a conversion error occurred. 


The following list shows the available return codes: 


0 No conversion was necessary. The CCSID of the text matched the CCSID that you wanted the 
text converted to. 


1 No conversion occurred. Either the text was 65535 or the CCSID that you wanted the text 
converted to was 65535. 


No conversion occurred. You did not ask for any text to be returned. 


The text was converted to the CCSID specified. The conversion operation used the linguistic 
conversion tables. 


4 A conversion error occurred when the conversion operation used the linguistic conversion tables. 
The conversion operation then used a default conversion table. The default conversion completed 
without error. 


-1 An error occurred on both the linguistic and default conversions. The text was not converted. 


Replacement data CCSID conversion status indicator (MDTACCSTA) return field: MDTACCSTA is the 
replacement data CCSID conversion status indicator. Return codes help you determine what happened 
when the system converted your replacement data to the CCSID-to-convert-to parameter. 


Positive return code numbers indicate that your conversion was successful. A successful return code does 
not always indicate that a conversion occurred. Negative return code numbers indicate that a conversion 
error occurred. These return codes are similar to the IXICCSTA return codes. The return codes apply to 
the conversion that takes place on any *CCHAR replacement data being converted from the replacement 
data CCSID to the CCSID-to-convert-to value. 


The following list shows the available return codes: 


0 No conversion was necessary. The CCSID of the replacement data matched the CCSID that you 
wanted the text converted to. 


1 No conversion occurred. Either the replacement data was 65535 or the CCSID that you wanted 
the replacement data converted to was 65535. 


2 No conversion occurred. Either you did not ask for any replacement data to be returned or no 
*CCHAR replacement data fields were defined for the message description being retrieved. 


3 The replacement data was converted to the CCSID specified. The conversion operation used the 
linguistic conversion tables. 


4 A conversion error occurred when the conversion operation used the linguistic conversion tables. 
The conversion operation then used a default conversion table. The default conversion completed 
without error. 


-1 An error occurred on both the linguistic and default conversions. The replacement data was not 
converted. 


Receive messages: The Receive Message command, the Receive Nonprogram Message 
(QMHRCVM) API, and the Receive Program Message API have a CCSID-to-convert-to 
parameter. This parameter determines which CCSID the text or data is converted to before it is returned to 
the user. 
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The Receive Message command and APIs convert the text or data from the CCSID of the message queue 
or message file to the CCSID supplied on the CCSID-to-convert-to parameter. When replacement data is 
returned, only the *CCHAR data is converted from the CCSID of the message queue to the 
CCSID-to-convert-to value. 


If the CCSID of the message file or message queue is 65534, the text or data is converted from the 
CCSID of the message description or message to the CCSID supplied on the CCSID-to-convert-to 
parameter. 


The default for the CCSID-to-convert-to parameter is *JOB, which means that the CCSID of the job 
performing the receive operation is used. 


Receive Message command CCSID return fields 


Two CCSID return fields are supported by the Receive Message (RCVMSG) command: 


Receive Message API CCSID return fields 


The Receive Message (QMHRCVWM) API ane tne Heceve Program Message oo BA sate API rel 
the return fields defined in IX 

. The Asceive ese API and the Receive Program Message API 
also support two additional return fields. 


Example 1: Using the CCSID return fields 


Message description TST0005 has the following first-level text: 
This is &CHAR &1; This is *CCHAR &2; 


’&1’ is defined as a *CHAR field of length 1. ’&2’ is defined as a *CCHAR field (*VARY 2) in length. 


Message file MYMSGF has a CCSID of 65534. TST0005 is defined in message file MYMSGF. The 
message description CCSID is 65535. The CCSID of message queue MYMSGQ is 65534. 


You enter the following Send Program Message command: 


SNDPGMMSG MSGF(MYMSGF) MSGID(TST0005) CCSID(37) TOMSGQ(MYLIB/MYMSGQ) + 
MSGDTA(X'7B00017B') 


The message is not converted when it is sent to message queue MYMSGQ because the message queue 
CCSID is 65534. The message is tagged with CCSID 00037. 


You enter the following Receive Message command to receive the message just sent: 

RCVMSG MSGQ(MYLIB/MYMSGQ) MSG(&MSG); DTACCSID(&DTACCSID); + 

CCSID(277) MSGDTA(&MSGDTA) ; TXTCCSID(&TXTCCSID) ; 

Note: X’7B’ is the number sign (#) on code page 00037. 

Because the message description is tagged 65535, no conversion is performed when retrieving the 
message text of TSTO005. The replacement data is tagged 00037. The *CCHAR part of the message data 
is converted from CCSID 00037 to CCSID 00277 before being inserted for &2; *CHAR data is never 


converted. 


The following table shows the returned values after the Receive Message command runs: 
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Value Description 


&MSG = This is &CHAR . This is *CCHAR #. 


The *CHAR data was not converted when substituted for &1; The *CHAR data remains 
X’7B’. X’7B’ is the code point on code page 00277 for A ligature ( 


AE 
). 


The *CCHAR data was converted to X’4A’ before it was substituted for &2; X’4A’ is the 
code point on code page 00277 for the number sign (#). 


&TXTCCSID = 65535 The &TXTCCSID variable was set to 65535 because no conversion occurred. When no 
conversion occurs, the CCSID of the message description is returned if the CCSID of the 
message file is 65534. 


&DTACCSID = 00277 The &DTACCSID variable was set to 00277 because a conversion occurred. 


Example 2: Receiving a message with CCSID support 


Message file MYMSGF has a CCSID of 00037. Message queue MYMSGQ has a CCSID of 65534. The 
message being received has a message-level CCSID of 00277. CCSID 65534 uses the message-level 
CCSID when determining the CCSID the replacement data is to be converted from. 


The message being received is a stored message. The stored message has *CCHAR replacement data. 
The CCSID of the job is 00278. You enter the following Receive Message command: 


RCVMSG MSGQ(MYMSGQ) MSG(&MSG); MSGDTA(&MSGDTA) ; 


The first-level text of the stored message that you receive is converted from CCSID 00037 to CCSID 
00278. The replacement data of the message that you receive is converted from CCSID 00277 to CCSID 
00278. Then the replacement data is substituted into the first-level text and returned in &MSG. 


Both the first-level text and the replacement data of the message that you received are converted to the 
CCSID of the job because the CCSID of the job is the default for the CCSID-to-convert-to parameter. 


Two different conversions must occur because only the replacement data is stored in the message queue 
for stored messages. The text of a stored message must be retrieved from the message file. If the 
message contained other replacement data type fields that were not defined as *CCHAR, the 
non-*CCHAR data is not converted before being returned. 


Note: If the CCSID of the message queue is 00278, no conversion occurs on the replacement data before 
the message is returned, even though the message CCSID is 00277. Remember that the message queue 
CCSID takes precedence over the message-level CCSID. 


CCSID of the message text returned (TXTCCSID) return field: TXTCCSID is the CCSID of the message 
text returned. If a conversion occurs and the conversion is successful, this value is always the same as 
the CCSID-to-convert-to value. 


For impromptu text, if the conversion is not successful, TXTCCSID is the CCSID of the message queue, 
unless the message queue is 65534. If the message queue is 65534, TXTCCSID is the message-level 
CCSID of the impromptu text. 


For a stored message, if the conversion is not successful, TXTCCSID is the CCSID of the message file 


that contains the stored message, unless the message file is 65534. If the CCSID of the message file is 
65534, TXTCCSID is the CCSID of the message description for the stored message. 
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CCSID of the replacement data returned (DTACCSID) return field: DTACCSID is the CCSID of the 
replacement data returned. DTACCSID applies only to those parts of the replacement data defined as 
*CCHAR. If the message being received is an impromptu message, a value of 0 is returned. If a 
conversion occurs and the conversion is successful, this value is the same as the CCSID-to-convert-to 
value. 


If the conversion is not successful, the DTACCSID returned is the CCSID of the message queue, unless 
the CCSID of the message queue is 65534. If the CCSID of the message queue is 65534, the DTACCSID 
returned is the CCSID of the message. 


For example, a stored message TST0004 from message file MYMSGF is sent to message queue 
YOURMSGQ with replacement data. TST0004 is defined with *CCHAR replacement data. Message file 
MYMSGF is 65534. Message queue YOURMSGQ has a CCSID of 00037. 


Your program needs to know the CCSID of the message description and the replacement data sent to 
message queue YOURMSGQ. You enter the following Receive Message command: 


RCVMSG MSGQ(YOURMSQ) CCSID(*HEX) TXTCCSID(&TXTCCSID); DTACCSID(&DTACCSID) ; 


The message description CCSID is returned in the variable &TXTCCSID. The message description CCSID 
is returned because you specified *HEX for the CCSID-to-convert-to parameter. *HEX means that no 
conversion is to occur. If no conversion occurs and the message file CCSID tag is 65534, the CCSID of 
the message description is returned. 


The CCSID of message queue YOURMSGQ (00037) is returned in the variable 8DTACCSID. The 
message queue CCSID is returned because it is not 65534. 


You can also obtain the message-level CCSID using the Display Messages (DSPMSG) display. 
1. From the Display Messages display, press Help to display the Additional Message Information display. 
2. Then press F9 (Display Message Details). 


This displays the message-level CCSID when the CCSID of the message queue that this message is 
on is 65534. Otherwise, the CCSID of the message queue is displayed. 


Common questions about CCSID support for handling messages: Following are some common 
questions asked about CCSID support for handling messages. 


¢ When is the job default CCSID used for handling messages? 

* How can | determine if a message description is defined with *CCHAR? 
* Can the length of *CCHAR replacement data change? 

* Can | correct the CCSID of a message queue? 

* Can | correct the CCSID of a message file? 

* Can | correct the CCSID of a message? 

* Can | correct the CCSID of a message description? 


When is the job default CCSID used for handling messages?: A job default CCSID is always a CCSID 
with an encoding scheme of 1100 (single-byte EBCDIC) or 1301 (mixed-byte EBCDIC). The job default 
CCSID is used whenever both of the following are true: 


* Aconversion occurs from a CCSID with an encoding scheme other than 1100 or 1301 to a job CCSID. 
* The job CCSID is 65535. 


For example, ASCII data must be converted to a CCSID associated with the data when asked to convert 
to the CCSID of a job. The job default CCSID is used because it is never CCSID 65535. 


How can | determine if a message description is defined with *CCHAR?: You can use the Work with 
Message Description (WRKMSGD] command to determine if a message description is defined with 
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*CCHAR data. You can also use the Retrieve Message (QMHRTVM) API to return the replacement data 
format fields. For more information, see the System API topic. 


Can the length of *CCHAR replacement data change?: The length of *CCHAR replacement data can 
change. This is why *CCHAR replacement data is required to be a variable-length field. The length of the 
field will grow when converting from an SBCS CCSID to the UCS-2 Level-1 CCSID. The length of the field 
will shrink when converting from the UCS-2 Level-1 CCSID to an SBCS CCSID. 


For example, you define message description TST0011 as ’Printer &1; has error &2;’ in message file 
MYMSGF that has a CCSID of 65535. ’&1’ is defined as *CCHAR data (*VARY 2) in length. This is the 
name of the printer. &2; is defined as a *CHAR data with a length of 1. This is an error code. Enter the 
following Send Program Message command to send this message to message queue MYMSGQ: 


SNDPGMMSG MSGID(TSTOO11) MSGF(MYLIB/MYMSGF) TOMSGQ(MYLIB/MYMSGQ) + 
MSGDTA(X'000400500030F1') CCSID(61952) 


X’0004’ is the length of the variable *CCHAR data. X’00500030’ represents the characters PO in CCSID 
61952. If message queue MYMSGQ has a CCSID of 00037, the replacement data is converted to 
X’0002D7F0F1’ before it is sent to the message queue. If message queue MYMSGQ has a CCSID of 
65535, the data is not converted when it is sent to the message queue. 


Your application programs cannot hard-code the position of the return code in this example. When 
message queue MYMSGQ has a CCSID of 00037, the return code is 5 bytes into the message text. When 
message queue MYMSGQ has a CCSID of 65535, the return code is 7 bytes into the message text. 


Can | correct the CCSID of a message queue?: You may have a message queue that has a CCSID that 
does not match the CCSID of the messages on it. This usually results from sending messages with a 
message-level CCSID of 65535 to a message queue with a CCSID that is not 65534 or 65535. 


If all of the messages on a message queue have the same message-level CCSID and you know the 
message-level CCSID, you can enter the following command: 


CHGMSGQ MSGQ(MYMSGQ) CCSID(nnnnn) 
In this example, MYMSGQ is the name of the message queue and nnnnn is the message-level CCSID. 


If you do not know the CCSID of all the messages on the queue or if the messages on the queue have 
different CCSIDs, the message queue should have a CCSID of 65535 or 65534. You can change the 
message queue CCSID to 65535. Or, you can do the following: 

1. Delete all of the messages. 

2. Change the CCSID of the message queue to 65534. 

3. Send all of the messages again. 


Can | correct the CCSID of a message file?: You may have a message file that has a CCSID that does 
not match the CCSID of the message descriptions in it. This usually results from adding message 
descriptions with a message-level CCSID of 65535 to a message file with a CCSID that is not 65534 or 
65535. 


If all of the message descriptions in a message file have the same message-level CCSID, and you know 
the message-level CCSID, you can enter the following command: 


CHGMSGF MSGF(MYMSGF) CCSID(nnnnn) 
In this example, MYMSGF is the name of the message file and nnnnn is the message-level CCSID. 
If you do not know the CCSID of all the message descriptions in the file or if the message descriptions in 


the file have different CCSIDs, the message file should have a CCSID of 65535 or 65534. You can handle 
this situation in either of the following ways: 
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* Change the CCSID of the message file to 65535. 
¢ Follow these steps: 
1. Change the CCSID of the message file to 65534. 


2. Change the message-level CCSID of each message description to the correct value. See Can J 
for information on how to correct the CCSID of a 


message description. 


Can | correct the CCSID of a message?: You cannot correct the message-level CCSID of a message. 
You can change the message queue CCSID to match the message-level CCSID. You can also delete the 
message and send it again with the correct message-level CCSID. 


Can _| correct the CCSID of a message description?: You can use the Change Message Description 

command to change the CCSID of a message description. If you do not change the first- or 
second-level text at the same time that you change the message description CCSID, the text remains 
unchanged. Only the CCSID changes. 


For example, you can enter the following Change Message Description command to correct the CCSID of 
a message description without changing any of the first- or second-level message text: 


CHGMSGD MSGF(MYLIB/MYMSGQ) MSGID(TST0001) CCSID(37) 


Work with bidirectional data 


Arabic and Hebrew languages use an alphabet written and read from right to left. Numerics and Latin text 
imbedded in the right-to-left text are written and read from left to right; therefore, these languages are 
called bidirectional languages. 


Because bidirectional languages are written and read from right to left, you should avoid using the terms 
left and right. For example, right margin in Hebrew or Arabic documents would be the beginning of the line 
and not the end. Use the words start and end in place of the words right and /eft. 


Hebrew and Arabic have no case-sensitive characters. To avoid the incorrect presentation of characters, 
no case-sensitive checking or substitution should be performed. In addition, the Arabic language does not 
use abbreviations, therefore, you should use only complete words. 


The following links provide additional information on Bidirectional application design: 
* Bidirectional application support 


See Code globalized applications that use bidirectional data: guidelines for information about how you can 
design your applications to accommodate bidirectional data. 


Bidirectional application support 
OS/400 provides bidirectional application support in the following ways: 


¢ Workstation 
* Display file 
« UIM 


Workstation support 
Workstations that have the ability to display Arabic and Hebrew character sets also have the ability of 


right-to-left cursor movement. Right-to-left cursor movement on input fields can be achieved in one of the 
following ways: 
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Pressing a special function key available on Hebrew and Arabic keyboards called the reverse key. This 
is a toggle function that moves the cursor to the other side of the field, allows for cursor movement in 
the opposite direction, and also changes the language layer from Latin to Hebrew or Arabic and back 
again. 

Using the DDS cursor control codes for display files. When the CHECK keyword is used with a 
cursor-controlled code, it specifies that the cursor is to move from right to left. The valid cursor control 
codes are: 


— CHECK (RL): Moves the cursor from right to left in specified nonnumeric input fields or in all 
nonnumeric input fields on the display. 


— CHECK (RLTB): Moves the cursor from right to left between fields. 


When using these parameters, remember the following: 


— Modulus check digit verification is supported, but the check digit is the byte to the extreme right of 
the field. 


— A field for which right-to-left cursor movement is specified can occupy more than one line on the 
display. However, the cursor still moves from the top of the display to the bottom. 


— You cannot use right-to-left cursor movement with user-defined data streams. 


Note: If no cursor positioning is specified in the display file or by the program, the cursor is placed in 
the input-capable field to the extreme left of the top line. 


See the IDDS Reference: Concepts topic for more information. 


Display file support 


The server does not check to make sure that all display files opened to the display station are capable of 
right-to-left cursor movement. Therefore, it is the responsibility of application programmers to ensure that 
the proper display files are used. 


User interface manager support 


The user interface manager gives the following bidirectional support for creating online information and 
panels: 


BIDI= NONE | RTL| LTR 
This attribute controls the directional orientation of the panels in the panel group. 


RTL indicates that the panel in the panel group is bidirectional and should be displayed with a 
right-to-left orientation. 


LTR indicates that the panel in the panel group is bidirectional and should be displayed with a 
left-to-right orientation. 


:-RT and :ERT 


Reverse-direction-text tags indicate that the enclosed text has an orientation that is opposite to the 
orientation of the panel group. 


For a list of UIM tags, see the [App 


eS 


PDF. 


Checklist: Bidirectional support guidelines 
When creating an application with bidirectional support, you must follow some guidelines. Some of these 
guidelines are listed in the following table: 
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Complies 


Not applicable 


Rule 


Software design must allow for bidirectional data to be passed to 
applications in the same order that a speaker of the language would 
spell it out. 


The product design must allow for the implementation of the correct 
handling of bidirectional keyboard and presentation functions. 


Designing of a function that implies logical movement of cursor or 
characters must permit mirroring of that function. 


Keys or operations labeled with directional icons or symbols must 
perform according to the icon or symbol. 


Keyboard nomenclature for mirrored functions must be independent 
of the direction of data or text entry. 


Display functions must not assume a left-to-right orientation. 


Field attributes must contain room for directional information. 


Indicator location must be reserved for the current direction of the 
cursor (direction of input). 


The design must allow for independent handling of graphic and text 
orientation. 


Provision must be made to allow shape determination to be 
performed. 


The deshaping must be definable. 


Provision must be made to allow the selection of the appropriate 
presentation shape for the numerals. 


Characters must be allowed to touch each other on printers and 
displays. 


Indicator locations should be reserved for screen and field 
orientation, current level of nesting, status of push (nesting 
mechanism), and status of symmetric swapping. 


The design should provide for a method to indicate to the user the 
nesting structure of a string. 


A system-wide method of deshaping Arabic characters or character 
strings should be provided. 


An indicator location should be provided for the status of shape 
determination. 


A method should be provided so that proportional spacing can be 
provided. 


A method should be provided to allow alignment of the baseline of 
Arabic and Latin characters (including Hindi and Arabic shapes for 
numerals). 


Work with DBCS data 


The following topics describe how you handle DBCS data in applications that use DBCS-capable device 


files: 


* Checklist: DBCS application design 


- [Develop applications that process DBCS data 


¢ DBCS code schemes 


¢ DBCS font tables 
DBCS font files 
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¢ DBCS sort tables 
¢ DBCS field definition 


A DBCS file is a file that contains double-byte data or is used to process double-byte data. Other files are 
called alphanumeric files. You can view DBCS files on display, printer, tape, diskette, and ICF devices. 


You use data description pee ae to describe DBCS-capable device files. For information 
about using DDS, see the topic. 


You should indicate that a file is DBCS in one or more of the following situations: 
* The file receives input, or displays or prints output, which has double-byte characters. 
* The file contains double-byte literals. 


¢ The file has double-byte literals in the DDS that are used in the file at processing time (such as 
constant fields and error messages). 


¢ The DDS of the file includes DBCS keywords. 
* The file stores double-byte data (database files). 


DBCS strings in a mixed data stream 


Usually, both single-byte characters and double-byte characters are used in a DBCS environment. For 
example, an accounting firm in Japan uses both English and Japanese for the spreadsheet. If both English 
and Japanese are being encoded as mixed SBCS and DBCS, the product must be able to understand a 
mixed character set that contains both single-byte coded characters and double-byte coded characters. 


In IBM systems that use EBCDIC, a DBCS string is bracketed in a mixed data stream by a shift-out (SO) 
control character and a shift-in (SI) control character. 


The following example shows the coding for a mixed string: 
sss (SO) D1D2D (SI) ssss 


The following example shows the coding for a mixed hexadecimal string: 
818283 OE 41424143 OF 818283 


Supported code ranges 


OS/400 supports Japanese, Korean, Simplified Chinese, and Traditional Chinese character-set code 
ranges. 


Using the family of products, the servers also provide support for these non-IBM personal 
computer DBCS code pages: 


¢ Republic of Korea National Standard graphic character set (KS) 
¢ Taiwan Industry Standard graphic character set (Big5) 
¢ The People’s Republic of China National Standard graphic character set (GB) 


Checklist: DBCS application design 

When creating an application with double-byte coded character set support, you must follow some 
guidelines. A complete list of these guidelines, as well as a full description of each guideline, is included in 
Volume 1 Designing Enabled Products, Rules and Guidelines (SEO9-8001). For your convenience, a 
subset of these guidelines is provided in the following table. 
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Complies Not applicable Rule 


Double-byte coded character set code points in the graphic 
character range must be used only for graphic characters and must 
not be used for control purposes. 


Single-byte meaning must not be drawn from either byte of 
double-byte coded data. 


Double-byte coded character set character generators must be 
capable of producing user-accessible graphic characters. 


The ability to switch between single-byte coded character set and 
double-byte coded character set and the coexistence of single-byte 
coded character set and double-byte coded character set in the 
same session must be possible. 


User-interface text modules for double-byte coded character set 
systems must be loaded separately from the running code. 


Develop applications that process DBCS data 
Design your application programs for processing double-byte data in the same way you [design application 
a for processing alphanumeric data, with the following additional considerations: 


Make sure that the double-byte data is always processed in a double-byte unit and does not split a 
double-byte character. 


Identify double-byte data used in the database files. 
Design display and printer formats that can be used with double-byte data. 


If needed, provide DBCS conversion as a means of entering double-byte data for interactive 
applications. Use the DDS keyword for DBCS conversion (IGCCNV) to specify DBCS conversion in 
display files. Because DBCS workstations provide a variety of double-byte data entry methods, you are 
not required to use the OS/400 DBCS conversion function to enter double-byte data. 


Create double-byte messages to be used by the program. 
Specify extended character processing so that the system prints and displays all double-byte data. 


Determine whether additional double-byte characters need to be defined. User-defined characters can 
be defined and maintained using the character generator utility (CGU). Information on CGU can be 
found in the ADTS/400: Character Generator Utility, SCO9-1769-00 book. 


The following topics provide more detailed information about how you can use DBCS data in your 
applications: 


Use of double-byte data 

DBCS coding considerations 

Process double-byte characters 

Display support 

Make DBCS-capable printer files 

Copy spooled and nonspooled DBCS files 
Change alphanumeric to DBCS programs 
DBCS text and CL commands 

DBCS conversion 

SQL and DBCS 


Use of double-byte data: You can use double-byte data in the following ways: 


As data in files: 
— Data in database files. 
— Data entered in input-capable and data displayed in output-capable fields of display files. 
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— Data printed in output-capable fields in printer files. 
— Data used as literals in display files and printer files. 
* As the text of messages. 
* As the text of object descriptions. 
* As literals and constants, and as data to be processed by high-level language programs. 


Double-byte data can be displayed only at DBCS displays and printed only on DBCS printers. Double-byte 
data can be written onto diskette, tape, disk, and optical storage. 


Where you cannot use double-byte data: 


You cannot use double-byte data in the following ways: 

¢ As OS/400 object names. 

¢ As command names or variable names in control language (CL) and other high-level languages. 
* As displayed or printed output on alphanumeric workstations. 


Double-byte character size: 
When displayed or printed, double-byte characters usually are twice as wide as single-byte characters. 


Consider the width of double-byte characters when you calculate the length of a double-byte data field 
because field lengths are usually identified as the number of single-byte character positions used. For 
more information on calculating the length of fields containing double-byte data, see the 

topic. 


DBCS coding considerations: |f you plan to have your application used in a DBCS environment, you 
should ensure that it is DBCS-enabled. Following are some suggestions to consider when developing your 
general product design. 


e Reserve more expansion space for DBCS textual data translation than you reserve for SBCS textual 
data translation. (It is possible, however, that the number of bytes used may be reduced when a SBCS 
sentence is being translated into DBCS.) 


¢ Ensure programs can understand shift-out and shift-in delimiters. Otherwise, EBCDIC mixed-byte 
character strings cannot be handled. 


* Do not enable short responses for DBCS. For short responses, it is difficult to shift in and out of DBCS. 
The yes and no are examples of short responses. 


¢ Remember to use the graphic data type G where appropriate. 
¢ Remember that the 5494 remote controller supports the graphic data type. 


* Be careful when converting mixed data between DBCS-host code and DBCS-PC code, because the 
transition may change the data length. Losing and gaining SO and SI character pairs can upset 
field-length calculations. 


* Make sure the double-byte data is always processed in a double-byte unit. Do not split a double-byte 
character. 


* Design the display as well as the print format to avoid the problem of truncation of a double-byte 
character into two single-byte units. 


See the following for additional DBCS coding considerations: 
* Creating physical files 

* Target physical files 

¢ Using CCSIDs 

¢ Using DDS keywords 

¢ DBCS file data types 
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* The Katakana code page (00290) 
* UCS-2 level-1 DBCS display support 


Creating physical files: When creating a physical file, display file, and printer file for a DBCS environment, 
consider the IGCDTA parameter present in the following commands: 
* Create Physical File (CRTPF} command 


lf DBCS fields are described in DDS, the system treats the file as a DBCS file. Otherwise, specify *YES 
for the parameter of the CRTPF command so that the file can contain double-byte character set data. 
However, the system ignores the IGCDTA parameter value when a value for the RCDLEN parameter is 
not specified. 


* Create Display File (CRTDSPP)| and Create Printer File (CRIPRTEY commands 
Specify *YES for the parameter when using the CRTDSPF or CRTPRTF commands to create the 


externally described files. Then DBCS attributes, in addition to those defined in the DDS, can be 
specified. 


Target physical files: When the CPYSPLE] DSPSPLE] or WRKSPLH commands with OUTPUT(*OUTFILE) 


option are used under the DBCS version of the OS/400 program, the target physical file must be 
DBCS-enabled. 


Note: The primary language of the system must support the double-byte character set to allow 
DBCS-enabled applications. 


Use the QIGC system value to check if a DBCS version of the system is installed. Because it is set by the 
system, it cannot be changed. This system value can be referred to in an application program. QIGC can 
be: 

* 0 (DBCS version is not installed) 


* 1 (DBCS version is installed) 


A DBCS system allows for concurrent use of SBCS and DBCS data. When the QIGC system value is 1, 
you should not assume all jobs are DBCS. 


Using CCS/IDs: Use DBCS CCSIDs for DBCS languages. When designing an application to be used in 

the DBCS environment, consider the following CCSID information: 

* If the QIGC system value is set on, system value QCCSID must have the value of a mixed CCSID. 

¢ If the DBCS and SBCS language users are sharing the same system, they may want to store their data 
in different databases. It is possible to create DBCS-capable and SBCS-capable physical files in the 
same system. The CCSID parameter on the command or the CCSID keyword on the physical 
file DDS definition can be used to specify the CCSID value that the data is stored in. 

* If aCCSID was not explicitly assigned through DDS at file creation time, the database physical file 
character J (DBCS-only), E (DBCS-either), O (DBCS-open) or G (DBCS-graphic) fields are implicitly 
assigned a CCSID value. 


Using DDS keywords: Consider the following DDS keywords so that you can specify alternative ways to 
enter data through display files, change input- and output-capable alphanumeric data fields to DBCS data 
fields, or to specify the special features of the DBCS printer output: 
¢ CHRSIZ (Character Size) 
This printer file keyword can expand the printer characters to twice the normal size (width and height). 
This keyword can is valid only for IPDS printers and for printer files with a device type of *IPDS or 
“AFPDS specified. 
* CONCAT (Concatenate) 
This keyword can be used only on logical files. This keyword does not support concatenation of a 
character field and a data type O field. 


* DFLIN (Define Line) 
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The printer file Keyword draws horizontal and vertical lines. 
¢ IGCALTTYP (DBCS Alternative Data Type) 


This display and printer keyword is used to change input- and output-capable character fields to DBCS 
fields with data type O. 


* IGCANKCNV (Alphanumeric-to-DBCS Conversion) 


This printer file keyword converts alphanumeric SBCS characters to equivalent DBCS characters. 
Printed SBCS alphanumeric characters have the same appearance as printed DBCS characters. The 
printed DBCS characters, however, are twice as wide as the equivalent SBCS alphanumeric characters. 


¢ IGCCDEFNT (DBCS Coded Font) 

This printer file keyword specifies the DBCS coded font for printing a named or constant field (or fields). 
* IGCCNV (DBCS Conversion) 

This is a display file keyword that enables DBCS conversion. 
* IGCCHRRTT (DBCS Character Rotation). 


This printer file keyword rotates each DBCS character 90 degrees counterclockwise before printing. By 
rotating characters, the system prints them in reading sequence. This keyword should be used only for 
printer files to be printed with 5553 printers or IPDS AFP(*YES) printers. 


For more information on the DDS keywords for DBCS, see the DDS Reference: Concepts topic. 


DBCS file data types: The data type of a field in a physical file may be changed when it is being referred 
to in a logical file, as summarized in the following table: 


Physical File Data Types Logical File Data Types 
J J, O, E, H, G 
O O,H 

E E,O,H 

A A, O, E, H 
H 

G 


J, O, E, A, H 


G,O,J,E 


The Katakana code page (00290): The Katakana code page (code page 00290) of Japan supports 
uppercase English and single-byte Katakana (phonetics) characters. The lowercase English characters are 
located at code points different from other code pages and the hardware may not be able to display 
English uppercase, lowercase, and Katakana characters concurrently. Therefore, special considerations 
should be taken if the application is going to support this code page: 


¢ Avoid using the lowercase alphabet for syntactic characters. 
¢ Avoid using the SBCS lowercase alphabet with Japanese DBCS messages. 


UCS-2 level-1 support and IBM DBCS displays: OS/400 supports ISO/IEC 10646 Universal Coded 
Character Set 2, Level 1 (UCS-2, Level-1). IBM DBCS-capable display stations, however, do not support 
UCS-2 Level-1 data. If you are designing an application to handle UCS-2 Level-1 data for display on an 
IBM DBCS-capable display, you must convert the data to a mixed-byte CCSID before sending the data to 
the display station. 


Process double-byte characters: Due to the large number of double-byte characters, the system needs 
more information to identify each double-byte character than is needed to identify each alphanumeric 
character. 

There are two types of double-byte characters: basic and extended. These characters are usually 
processed by the device on which the characters are displayed or printed. 
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Basic double-byte characters: 


Basic characters are frequently used double-byte characters that reside in the hardware of a 
DBCS-capable device. The number of double-byte characters stored in the device varies with the 
language supported and the storage size of the device. A DBCS-capable device can display or print basic 
characters without using the extended character processing function of the operating system. 


Double-byte extended characters: 


When processing extended characters, the device requires the assistance of the system. The system must 
tell the device what the character looks like before the device can display or print the character. Extended 
characters are stored in a DBCS font table, not in the DBCS-capable device. When displaying or printing 
extended characters, the device receives them from the DBCS font table under control of the operating 
system. 


Extended character processing is a function of the operating system that is required to make characters 
stored in a DBCS font table available to a DBCS-capable device. 


To request extended character processing, specify the double-byte extended character parameter, 
IGCEXNCHR(*YES), on the file creation command when you create a display (CRIDSPE] or create a 
printer file (care command that processes double-byte data. Because IGCEXNCHR(*YES) is the 
default value, the system automatically processes extended characters unless you instruct it otherwise. 
You can change this file attribute by using the change file or (CHGPATE) command. You can 
override the file attribute with the override display file or override printerfile 

command. For example, to override the display file DBCSDSPF so that extended characters are 
processed, enter: 


OVRDSPF DSPF(DBCSDSPF) IGCEXNCHR(*YES) 


Notes: 
1. The system ignores the IGCEXNCHR parameter when processing alphanumeric files. 


2. When you use the Japanese 5583 Printer to print extended characters, you must use the Kanji print 
function of the Advanced DBCS Printer Support licensed program. 


What happens when extended characters are not processed: 


When extended characters are not processed, the following happens: 
* Basic double-byte characters are displayed and printed. 


* On displays, the system displays the undefined character where it would otherwise display extended 
characters. 


* On printed output, the system prints the undefined character where it would otherwise print extended 
characters. 


¢ The extended characters, though not displayed or printed, are stored correctly in the system. 
Display support: The following provides useful information about displaying double-byte characters. 
Inserting shift-control characters: 

The system inserts shift-control characters into DBCS-only fields automatically. 


To insert shift-control characters into open fields or either fields, do the following: 
1. Position the cursor in the field in which you want to insert double-byte data. 
2. Press the Insert Shift Control Character key (according to your DBCS display user’s guide). 
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The system inserts a pair of shift-control characters at the same time. The system leaves the cursor under 
the shift-in character and puts the keyboard in insert mode. Insert double-byte characters between the 
shift-control characters. 


To find out if a field already has the shift-control characters, press the Display Shift Control Character key. 


DBCS-graphic fields store double-byte characters without requiring the use of shift control characters. Shift 
control characters should not be inserted in graphic fields. 


Number of displayed extended characters: 


The system can display up to 512 different extended characters on a Japanese display at one time. 
Additional extended characters are displayed as undefined characters. However, the additional extended 
characters are stored correctly in the system. 


Number of input fields on a display: 


The use of DBCS input fields affects the total number of input fields allowed on a display. For a local 5250 
display, you can specify as many as 256 input fields. However, each three instances of a DBCS field 
reduces the maximum number of fields by one. For example, if there are 9 DBCS fields on a display, then 
the maximum is 256 - (9/3) = 253 input fields. 


Effects of displaying double-byte data at alphanumeric workstations: 


Alphanumeric displays cannot display double-byte data correctly. If you try to display double-byte data at 
an alphanumeric display, the following happens: 


* The system sends an inquiry message to that display, asking whether you want to continue using the 
program with double-byte data or to cancel it. 


* If you continue using the program, the system ignores the shift-control characters and interprets the 
double-byte characters as though they were single-byte characters. Displayed double-byte data does 
not make sense. 


Make printer files DBCS capable: \n many cases, printer files are used by the server to produce data 
that will eventually be printed or displayed. In these cases, the data is first placed into a spooled file using 
one of the IBM-supplied printer files. The data is then taken from the spooled file and is displayed or 
printed based on the request of the user. 


When the data involved contains double-byte characters, the printer file that is used to place the data into 
the spooled file must be capable of processing double-byte data. A printer file is capable of processing 
double-byte data when *YES is specified on the IGCDTA parameter for the file. In most cases, the system 
recognizes the occurrence of double-byte data and takes appropriate measures to ensure the printer file 
that is used is capable of processing double-byte data. 


In some cases, however, the system cannot recognize the occurrence of double-byte data and may 
attempt to use a printer file that is not capable of processing double-byte data. If this occurs, the output at 
the display or printer may not be readable. This can happen when object descriptions containing 
double-byte characters are to be displayed or printed on an alphanumeric device. 


To ensure that you receive correct results when you display or print double-byte characters, some 
recommendations should be followed. Action is required on your part if you have a single-byte national 
language installed as a secondary language. Printer files that are received as part of the DBCS version of 
a product are always capable of processing DBCS data. 


You should complete the following recommended actions after the product or feature has been installed: 
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1. If all printers and display devices attached to your system are DBCS-capable, you can enable all 
printer files for double-byte data. For IBM-supplied printer files that are received as part of a 
single-byte secondary language feature, you can enable all printer files by issuing the following 
command: 


CHGPRTF FILE(*ALL/*ALL) IGCDTA(*YES) 


After this command has been completed, all printer files in all libraries will be enabled for double-byte 
data. The change will be permanent. 


2. If all printer and display devices attached to your system are not DBCS-capable, it is recommended 
that you do not enable all IBM-supplied printer files. 


Instead, use the library search capabilities of the system to control which printer files will be used for 
any particular job. When the potential exists that double-byte data will be encountered, the library list 
for the job should be such that the printer files that are DBCS-enabled will be found first in the library 
list. Conversely, if only single-byte data is expected to be encountered, the library list should be set up 
so the printer files that are not enabled for DBCS will be found first. In this way, the printer file 
capabilities will match the type of data that will be processed. The decision as to what type of printer 
file to use is made on the basis of what type of data will be processed. The device that will be used to 
actually display or print the data may also influence this decision. 


In some cases it may be desirable to make the printer file only temporarily DBCS-capable instead of 
making a permanent change. For a specific job, you can make this temporary change by using the 
OVEPRTE command. 


To temporarily enable a specific printer file, you can use the following command: 
OVRPRTF FILE(filename) IGCDTA(*YES) 


Where filename is the name of the printer file you want to enable. 

Copy spooled and nonspooled DBCS files: You can copy both spooled and nonspooled DBCS files. 
Copying spooled files 

Copy spooled files that have double-byte data by using the Copy Spooled File (CPYSPLF) command. 
However, the database file to which the file is being copied must have been created with the 
IGCDTA(*YES) value specified. 

When copying spooled files to a database file that contains double-byte data, an extra column is reserved 
for the shift-out character. This shift-out character is placed between the control information for the record 


and the user data. The following table shows the shift-out character column number, based on the value 
specified for the Control character (CTLCHAR) keyword: 


CTLCHAR value Column for shift-out character 
*NONE 1 

*FCFC 2 

*PRTCTL 5 

*S36FMT 10 


Copying nonspooled DBCS files 
You can use the Copy File (CPYE) command to copy double-byte data from one file to another. 


When copying data from a double-byte database file to an alphanumeric database file, specify one of the 
following on the CPYF command: 


OS/400 globalization 115 


* lf both files are source files or if both files are database files, you can specify either the FMTOPT(*MAP) 
parameter or the FMTOPT(*NOCHk) parameter. 


¢ If one file is a source file and the other file is a database file, specify the FMT(*CVTSRC) parameter. 


When you copy DBCS files to alphanumeric files, the system sends you an informational message 
describing the difference in file types. 


Either the FMTOPT(*MAP) or FMTOPT(*NOCHk) option of the copy file function must be specified for 
copies from a physical or logical file to a physical file when there are fields with the same name in the 
from-file and to-file, but the data type for fields is as shown in the following table: 


From-file field data type 


To-file field data type 


A (character) 


O (DBCS-open) 


J (DBCS-only) 
J (DBCS-only) 


O (DBCS-open) 


E (DBCS-either) 


E (DBCS-either) J (DBCS-only) 

J (DBCS-only) G (DBCS-graphic) 
O (DBCS-open) G (DBCS-graphic) 
E (DBCS-either) G (DBCS-graphic) 
G (DBCS-graphic) J (DBCS-only) 

G (DBCS-graphic) O (DBCS-open) 
G (DBCS-graphic) E (DBCS-either) 


When you use FMTOPT(*MAP) on the CPYF command to copy data to a DBCS-only field or 
DBCS-graphic field, the corresponding field in the from-file must not be: 


¢ Less than a 2-byte character field 
¢ An odd-byte-length character field 
* An odd-byte-length DBCS-open field 


If you attempt to copy with one of these specified in the from-field, an error message is sent. 


When you copy double-byte data from one database file to another with the FMTOPT(*MAP) parameter 
specified, double-byte data will be copied correctly. The system will perform correct padding and truncation 
of double-byte data to ensure data integrity. 


When using the CPYF command with FMTOPT(*MAP) to copy a DBCS-open field to a graphic field, a 
conversion error occurs if the DBCS-open field contains any SBCS data (including blanks). 


Change alphanumeric programs to DBCS programs: lf an alphanumeric application program uses 
externally described files, you can change that application program to a DBCS application program by 
changing the externally described files. To convert an application program, do the following: 


1. Create a duplicate copy of the source statements for the alphanumeric file that you want to change. 
2. Change alphanumeric constants and literals to double-byte constants and literals. 


3. Change fields in the file to the open (O) data type or specify the Alternative Data Type [IGCALTTYP) 
DDS keyword so that you can enter both double-byte and alphanumeric data in these fields. You may 
want to change the length of the fields as the double-byte data takes more space. 


4. Store the converted file in a separate library. Give the file the same name as its alphanumeric version. 


5. When you want to use the changed file in a job, change the library list, using the Change Library List 
(CHGLIBL] command, for the job in which the file will be used. The library in which the DBCS display 
file is stored is then checked before the library in which the alphanumeric version of the file is stored. 
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Enter DBCS text in CL commands: You can use double-byte character data anywhere in a CL 
command that descriptive text can be used. 


Enter double-byte character text as follows: 

1. Begin the double-byte character text with an apostrophe (’). 
Enter a shift-out character. 

Enter the double-byte character text. 

Enter a shift-in character. 

End the double-byte character text with an apostrophe (’). 


aaron 


For example, to enter the double-byte character literal ABC, enter the following, where SO represents the 
shift-out character and SI represents the shift-in character: 


"SOABCSI' 


Limit the length of a double-byte character text description of an object to 14 double-byte characters, plus 
the shift control characters, to make sure that the description is properly displayed and printed. 


DBCS conversion: When you use DBCS displays to enter double-byte data, you may use the various 
data entry methods supported on the display, or you may choose to use the server DBCS conversion 
support. DBCS conversion lets you enter an alphanumeric entry or DBCS code and convert the entry or 
code to its related DBCS word. DBCS conversion is intended for Japanese character sets and its use is 
limited for application to other double-byte character sets. 


Specifically, DBCS conversion lets you convert the following: 

* A string of alphanumeric characters to a DBCS word 

* English alphanumeric characters to double-byte alphanumeric characters 
¢ Alphanumeric Katakana to double-byte Hiragana and Katakana letters 

* A DBCS code to its corresponding double-byte character 

¢ A DBCS number to its corresponding double-byte character 


The following links provide additional information about DBCS conversion: 
* Conversion dictionaries 

¢ Work with conversion dictionaries 

¢ Japanese DBCS conversion 


Conversion dictionaries: The DBCS conversion dictionary is a collection of alphanumeric entries and their 
related DBCS words. The system refers to the dictionary when performing DBCS conversion. 


All DBCS conversion dictionaries have an object type of *~IGCDCT. A system-supplied and a user-created 
dictionary are used with DBCS conversion. 


User-created dictionary: 


A user-created dictionary contains any alphanumeric entries and related DBCS words that you choose to 
include. You might create a user dictionary to contain words unique to your business or words that you use 
regularly but that are not included in the system-supplied dictionary. 


You can create one or more DBCS conversion dictionaries with any name and store them in any library. 
When performing DBCS conversion, however, the system only refers to the first user dictionary named 
QUSRIGCDCT in the user’s library list, no matter how many dictionaries you have or what they are 
named. Make sure that the library list is properly specified so that the system checks the correct dictionary. 
During DBCS conversion, the system checks QUSRIGCDCT before checking QSYSIGCDCT. 
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DBCS conversion dictionary commands: 


You can use the following commands to perform object management functions with the DBCS conversion 
dictionary. Specify the OBJTYPE(*IGCDCT) parameter when entering these commands: 


» (CHGOBIOWNI Change the owner of a DBCS conversion dictionary 
- [CHKOBL Check a DBCS conversion dictionary 

- [cRTDUPOR Create a duplicate object of the dictionary 
° Dump a DBCS conversion dictionary 

7 Dump the system-supplied dictionary 

- [DSPOBJAUT Display a user’s authority to the dictionary 
- GRIOBJAUT] Grant authority to use the dictionary 

‘ Move the dictionary to another library 

‘ Rename the dictionary 

- [RSTOB,H Restore the dictionary 

- [RBVKOBJAUT] Revoke authority to use the dictionary 

- [SAVOB] Save the dictionary 

- [SAVCHGOBJ Save a changed dictionary 


The system saves or restores DBCS conversion dictionaries when you use these commands: 

- [RSTLIB] Restore a library in which the dictionary is stored 

- [SAVLIBI Save a library in which the dictionary is stored 

- [Savsysj Save QSYSIGCDCT, the system DBCS conversion dictionary, when saving the system 


You can use the following commands to create, edit, display, and delete a dictionary: 
* ICRTIGCDCT] Create DBCS Conversion Dictionary 

- IEDTIGCDCT Edit DBCS Conversion Dictionary 

> DsPiGcpcT] Display DBCS Conversion Dictionary 

- [DLTIGCDCT] Delete DBCS Conversion Dictionary 


Work with conversion dictionaries: The following topics describe how you create, edit, display, print, and 
delete conversion dictionaries. 


Create a DBCS conversion dictionary 


To create a DBCS conversion dictionary, do the following: 
1. Use the Create DBCS Conversion Dictionary (CRTIGCDCT)] command. 


2. Name the dictionary, QUSRIGCDCT, so it can be used during DBCS conversion. The system uses the 
dictionary if it is the first user-created dictionary found when searching a user's library list. 


You might call the dictionary by another name while it is being created to prevent application programs 
from using it for conversion. Later, change the dictionary name using the Rename Object (RNMOB4J) 
command. 


3. Use the EDTIGCDCT command to put entries and related words into the dictionary after creating it. 
Edit a DBCS conversion dictionary 
Use the Edit DBCS conversion dictionary (EDTIGCDCT) command to edit the DBCS conversion dictionary. 


Use editing to add user-defined characters to the dictionary, so that users can enter characters using 
DBCS conversion, and rearrange terms in a DBCS conversion dictionary to suit individual needs. 
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The display needed for use while editing the DBCS conversion dictionary depends on the value that you 
entered for the ENTRY parameter on the EDTIGCDCT command: 


* If you specified a specific string with the ENTRY parameter or if you want to display double-byte 
characters, you must use a DBCS display. 


¢ If you did not specify a specific string with the ENTRY parameter, or if you do not want to display 
double-byte characters, use either a DBCS display, or a 24-row by 80-column alphanumeric display. 


You may perform the following editing operations on a user-created DBCS conversion dictionary: 


* Add entries to the dictionary (including adding the first entries to the dictionary after it is created). The 
dictionary can contain as many as 99,999 entries. 


* Delete entries from the dictionary. 
¢ Change entries in the dictionary, such as replacing the DBCS words related to an alphanumeric entry. 


¢ Move the DBCS words related to an alphanumeric entry to rearrange the order in which they appear 
during DBCS conversion. 


The only editing function that you can perform with QSYSIGCDCT, the system-supplied dictionary, is to 
move DBCS words related to an alphanumeric entry. Move words in order to rearrange the order in which 
they appear during DBCS conversion. 


Display and print the DBCS conversion dictionary 


Use the Display DBCS Conversion Dictionary (DSPIGCDCT) command to display and print the DBCS 
conversion dictionary. You can display or print the entire dictionary or just a certain part of it, depending on 
the value you specify for the ENTRY parameter. 


For example, to print the entry ABC from the dictionary QUSRIGCDCT and its related words, enter: 


DSPIGCDCT IGCDCT(DBCSLIB/QUSRIGCDCT) + 
ENTRY (ABC) OUTPUT (*PRINT) 


To display all of the entries from the system-supplied dictionary QSYSIGCDCT and their related words, 
enter: 


DSPIGCDCT IGCDCT(QSYS/QSYSIGCDCT) 
Delete a DBCS conversion dictionary 


Use the Delete DBCS Conversion Dictionary (DLTIGCDCT)] command to delete a DBCS conversion 
dictionary from the system. In order to delete the dictionary, you must have object existence authority to 
the dictionary and object operational authorities to the library in which the dictionary is stored. 


When you delete a dictionary, make sure that you specify the correct library name. It is possible that many 
users have their own dictionaries, each named QUSRIGCDCT, stored in their libraries. If you do not 
specify any library name, the system deletes the first DBCS conversion dictionary in your library list. 


Japanese DBCS conversion: When you use DBCS displays to enter double-byte data, you may use the 
various data entry methods supported on the display, or you may choose to use the OS/400 DBCS 
conversion support. DBCS conversion lets you enter an alphanumeric entry or DBCS code and convert the 
entry or code to its related DBCS word. DBCS conversion is intended for Japanese character sets and its 
use is limited for application to other double-byte character sets. 


Specifically, DBCS conversion lets you convert the following: 

* Astring of alphanumeric characters to a DBCS word 

¢ English alphanumeric characters to double-byte alphanumeric characters 
¢ Alphanumeric Katakana to double-byte Hiragana and Katakana letters 
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¢ A DBCS code to its corresponding double-byte character 
¢ A DBCS number to its corresponding double-byte character 


Japanese system-supplied dictionary 


The QSYSIGCDCT is the system-supplied dictionary that is stored in the library, QSYS. It is a collection of 
entries with a Japanese pronunciation, expressed in alphanumeric characters, and the DBCS words 
related to those entries. The system checks this dictionary second when performing DBCS conversion. 


QSYSIGCDCT contains these entries: 
¢ Personal names 
— Family names 
— First names 
* Organization names 
— Private enterprises registered in the security market 
Public corporations 
Typical organizations in the central and local governments 
— Most universities and colleges 
* Addresses 
— Public administration units within the prefectures 
— Towns and streets in 11 major cities 


¢ Business terms, such as department names and position titles commonly used in enterprises 
¢ Individual double-byte characters, including basic double-byte characters, as defined by IBM 


You cannot add or delete entries from this dictionary. However, you may rearrange the related DBCS 
words so that the words used most frequently are displayed first during DBCS conversion. 


SQL and DBCS: The basic symbols of keywords and operators in the SQL language are single-byte 
characters that are part of all character sets supported by the IBM relational database products. 
Characters of the language are classified as letters, digits, or special characters. 


SQL host identifiers and double-byte characters 


A host-identifier is a name declared in the host program. The rules for forming a host-identifier are the 
rules of the host language, except that DBCS characters cannot be used. 


SQL character subtypes and double-byte characters 


Each character string is further defined as one of the following: 


¢ Bit data: Data that is not associated with a coded character set and is never converted. The CCSID for 
bit data is 65535. 


¢ SBCS data: Data in which every character is represented by a single byte. Each SBCS data character 
string has an associated CCSID. If necessary, an SBCS data character string is converted before it is 
used in an operation with a character string that has a different CCSID. 


¢ Mixed data: Data that may contain a mixture of characters from a single-byte character set (SBCS) and 
a double-byte character set (DBCS). Each mixed data character string has an associated CCSID. If 
necessary, a mixed data character string is converted before an operation with a character string that 
has a different CCSID. If mixed data contains a DBCS character, it cannot be converted to SBCS data. 


The database manager does not recognize subclasses of double-byte characters, and it does not assign 
any specific meaning to particular double-byte codes. However, if you choose to use mixed data, then two 
single-byte EBCDIC codes are given special meanings: 
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° X’0E’, the “shift-out” character, is used to mark the beginning of a sequence of double-byte codes. 
° X’OF’, the “shift-in’ character, is used to mark the end of a sequence of double-byte codes. 


In order for the database manager to recognize double-byte characters in a mixed data character string, 
the following condition must be met: 


¢ Within the string, the double-byte characters must be enclosed between paired shift-out and shift-in 
characters. 


The pairing is detected as the string is read from left to right. The code X’0E’ is recognized as a shift 
out character if X’OF’ occurs later; otherwise, it is invalid. The first X’OF’ following the X’0E’ that is on a 
double-byte boundary is the paired shift-in character. Any X’0OF’ that is not on a double-byte boundary is 
not recognized. 


There must be an even number of bytes between the paired characters, and each pair of bytes is 
considered to be a double-byte character. There can be more than one set of paired shift-out and 
shift-in characters in the string. 


The length of a mixed data character string is its total number of bytes, counting two bytes for each 
double-byte character and one byte for each shift-out or shift-in character. 


When the job CCSID indicates that DBCS is allowed, CREATE TABLE will create character columns as 
DBCS-Open fields, unless FOR BIT DATA, FOR SBCS DATA, or an SBCS CCSID is specified. The SQL 
user will see these as character fields, but the system database support will see them as DBCS-Open 
fields. 


For more information on SQL and DBCS, see the following: 

* SQL graphic strings 

* SQL assignments and comparisons 

* SQL conversion rules 

SQL graphic strings: A graphic string is a sequence of double-byte characters that does not include 
shift-out or shift-in characters. The length of the string is the number of its characters. Like character 
strings, graphic strings can be empty. 


Every graphic string has a CCSID that identifies a double-byte coded character set. If necessary, a graphic 
string is converted before it is used in an operation with a graphic string that has a different CCSID. 


SQL fixed-length and double-byte characters 


All values of a fixed-length graphic-string column have the same length, which is determined by the length 
attribute of the column. The length attribute must be between 1 through 16383 inclusive. 


SQL graphic-string constants 


A graphic-string constant is a varying-length graphic string. The length of the specified string cannot be 
greater than 16370. 


In the normal form, the SQL delimiters and the G or the N are SBCS characters. The SBCS apostrophe (’) 
is the EBCDIC apostrophe, X’7D’. 


In the PL/I form, the apostrophes and the G are DBCS characters. Two consecutive DBCS string 
delimiters are used to represent one string delimiter within the string. Notice that this PL/I form is only valid 
for static statements embedded in PL/I programs. 


A hexadecimal graphic constant is also supported. The form of the hexadecimal graphic constant is: 
GX'ssss' 
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In the constant, ssss represents a string from 0 to 32766 hexadecimal digits. The number of characters 
between the string delimiters must be an even multiple of 4. Each group of 4 digits represents a single 
graphic character. The hexadecimal for shift-in and shift-out (X’OE’ and X’OF’) are not included in the 
string. 


The CCSID assigned to constants is the DBCS CCSID associated with the CCSID of the source unless 
the source is encoded in a foreign encoding scheme (such as ASCIl). In this case, the CCSID assigned to 
the constant is the DBCS CCSID associated with the default CCSID of the application server when the 
SQL statement containing the constant is prepared. If there is no DBCS CCSID associated with the 
CCSID of the source, the CCSID is 65535. 


SQL assignments and comparisons: The basic operations of SQL are assignment and comparison. 
Assignment operations are performed during the running of CALL, INSERT, UPDATE, FETCH, and 
SELECT INTO statements. Comparison operations are performed during the running of statements that 
include predicates and other language elements such as MAX, MIN, DISTINCT, GROUP BY, and ORDER 
BY. 


The basic rule for both operations is that the data type of the operands involved must be compatible. The 
compatibility rule also applies to UNION, concatenation, and the VALUE, COALESCE, MIN, and MAX 
scalar functions. 


SQL string assignments and double-byte characters 


The basic rule for string assignments is that the length of a string assigned to a column must not be 
greater than the length attribute of the column. (Trailing blanks are normally included in the length of the 
string. For string assignment, however, trailing blanks are not included in the length of the string.) 


If the string contains mixed data, the assignment rules may require truncation within a sequence of 
double-byte codes. To prevent the loss of the shift-in character that ends the double-byte sequence, 
additional characters may be truncated from the end of the string, and a shift-in character added. In the 
truncated result, there is always an even number of bytes between each shift-out character and its 
matching shift-in character. 


Character, DBCS-only, DBCS-open, and DBCS-either are not compatible with graphic types for 
assignment. 


SQL conversion rules: When two strings are compared, one of the strings is first converted, if necessary, 
to the coded character set of the other string. Character conversion is necessary only if all of the following 
are true: 

* The CCSIDs of the two strings are different. 

* Neither CCSID is 65535. 

* The string selected for conversion is neither null nor empty. 

¢ The CCSID conversion selection table indicates that conversion is necessary. 


If one string has an SBCS CCSID and the other is the same type of operand and has a mixed data 
CCSID, the SBCS data character string is converted. Otherwise, the string selected for conversion 
depends on the type of each operand. The following table shows which operand is selected for 
conversion, given the operand types. 


Column value Derived value Special register | Constant Host variable 
(second (second (second (second (second 
First operand operand) operand) operand) operand) operand) 
Column value Second Second Second Second Second 
Derived Value First Second Second Second Second 
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Column value Derived value Special register | Constant Host variable 
(second (second (second (second (second 
First operand operand) operand) operand) operand) operand) 
Special Register First First Second Second Second 
Constant First First First Second Second 
Host Variable First First First First Second 


A host variable containing data in a foreign encoding scheme is always effectively converted to the native 
encoding scheme before it is used in any operation. The above rules are based on the assumption that 
this conversion has already occurred. 


An error occurs if a character of the string cannot be converted or the CCSID Conversion Selection Table 
is used but does not contain any information about the pair of CCSIDs. A warning occurs if a character of 


the string is converted to the substitution character. 


DBCS code schemes 
IBM supports two DBCS code schemes: one for the host systems, the other for personal computers. The 


DBCS code scheme for host systems has the following code-range characteristics: 
* First byte: hex 41 to hex FE 

* Second byte: hex 41 to hex FE 

* Double-byte blank: hex 4040 


Shift-control characters 


When the host code scheme is used, the system uses shift-control characters to identify the beginning and 
end of a string of double-byte characters. The shift-out (SO) character, hex OE, indicates the beginning of 
a double-byte character string. The shift-in (SI) character, hex OF, indicates the end of a double-byte 
character string. 


Each shift-control character occupies the same amount of space as one alphanumeric character. By 
contrast, double-byte characters occupy the same amount of space as two alphanumeric characters. 


When double-byte characters are stored in a graphic field or a variable of graphic data type, there is no 
need to use shift control characters to surround the double-byte characters. 


Incorrect and undefined double-byte code 

Incorrect double-byte code has a double-byte code value that is not in the valid double-byte code range. 
This is in contrast to undefined double-byte code where the double-byte code is valid, but no graphic 
symbol has been defined for the code. 


Supported DBCS code ranges 


OS/400 supports the following DBCS character-set code ranges: 
e Japanese character-set code range 

* Korean character-set code range 

* Simplified Chinese character-set code range 

* Traditional Chinese character-set code range 


See Appendix A (DBCS Code Scheme) in the ile Managemeni PDF for details. 
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DBCS font tables 
DBCS font tables contain the images of the double-byte extended characters used on the system. The 
system uses these images to display and print extended characters 


a 


when they are not resident on the device. 


% 


The following DBCS font tables are objects that you can save or restore. These font tables are distributed 
with the DBCS national language versions of the OS/400 licensed program: 


QIGC2424 
A Japanese DBCS font table used to display and print extended characters in a 24-by-24 dot 
matrix image. The system uses the table with Japanese displays, printers attached to displays, 
5227 Model 1 Printer, and the 5327 Model 1 Printer. 


QIGC2424C 
A Traditional Chinese DBCS font table used to print extended characters in a 24-by-24 dot matrix 
image. The system uses the table with the 5227 Model 3 Printer and the 5327 Model 3 Printer. 


QIGC2424K 
A Korean DBCS font table used to print extended characters in a 24-by-24 dot matrix image. The 
system uses the table with the 5227 Model 2 Printer and the 5327 Model 2 Printer. 


QIGC2424S 
A Simplified Chinese DBCS font table used to print extended characters in a 24-by-24 dot matrix 
image. The system uses the table with the 5227 Model 5 Printer. 


QIGC3232 
A Japanese DBCS font table used to print characters in a 32-by-32 dot matrix image. The system 
uses the table with the 5583 Printer and the 5337 Model 1 Printer. 


QIGC3232S 
A Simplified Chinese DBCS font table used to print characters in a 32-by-32 dot matrix image. The 
system uses the table with the 5337 Model RO5 Printer. 


All DBCS font tables have an object type of *IGCTBL. You can find instructions for adding user-defined 
characters to DBCS font tables in the ADTS/400: Character Generator Utility, SCO9-1769-00 book. 


DBCS font table commands 


The following commands allow you to manage and use DBCS font tables: 


* Check DBCS Font Table (CHKIGCTBL) 
* Copy DBCS Font Table (CPYIGCTBI)) 

* Delete DBCS Font Table (DLTIGCTBI 
¢ Start Font Management Aid (STREMA)| 


Locate an existing font table 


Use the Check DBCS Font Table (CHKIGCTBL)) command to find out if a DBCS font table exists in your 
system. 


For example, to find out if the table QIGC2424 exists, enter: 
CHKIGCTBL IGCTBL(QIGC2424) 


If the table does not exist, the system responds with a message. If the table does exist, the system simply 
returns without a message. 


124 iSeries: Globalization (Handle data in global applications) 


Check for the existence of a table when adding a new type of DBCS workstation to make sure that the 
table used by the device exists in the system. 


For more information 


For additional information, see the following: 
* Copy a DBCS font table 
* Delete a DBCS font table 


Copy a DBCS font table: Use the Copy DBCS Font Table [(CPYIGCTBL)| command to copy a DBCS font 
table to or from tape, diskette, or physical file. 


The DBCS font tables are saved when you use the Save System (SAVSYS)] command so you do not have 
to use the CPYIGCTBL command when performing normal system backup. 


A physical file used to save and restore table information must have a minimum record length of 74 bytes. 
Copying a table onto a tape, a diskette, or a physical file 


You should copy a DBCS font table onto a tape, a diskette, or a physical file in the following instances: 
* Before deleting that table 

¢ After new user-defined characters are added to the tables 

¢ When planning to use the tables on another system 


To copy a DBCS font table onto a tape, a diskette, or a physical file do the following: 


1. If copying a DBCS font table onto a tape or diskettes, make sure that the tape or diskettes are 
initialized to the *DATA format. If necessary, initialize the tape or diskettes by specifying the 
FMT(*DATA) parameter on the Initialize Diskette (INZDKT)) command. 


2. Load the initialized tape or diskette onto the system. 
3. Enter the command as follows: 
a. Choose the value OPTION(*OUT). 


b. Use the DEV parameter to select the device to which you want to copy the table. A value of *FILE 
specifies that the DBCS font table is saved to a physical file. 


c. Use the SELECT and RANGE parameters to specify which portion of the table you want copied 
from the system. For more information on the valid codes and numbers to specify for starting and 
ending values of user-defined character ranges, see the . 


4. Press the Enter key. The system copies the DBCS font table onto the specified medium or into a 
physical file. 


5. Remove the tape or the diskette after the system finishes copying the table. 
Copying a DBCS font table from a tape, a diskette, or a physical file 


Use the Copy DBCS Font Table (CPYIGCTBI command to copy a DBCS font table from a tape, a 
diskette, or a physical file onto the system. The system automatically creates the DBCS font table again 
when copying its contents if the following are true: 


* The specified table does not already exist in the system. 


* The medium or physical file from which you are copying the table contains all of the IBM-defined 
double-byte characters. 


¢ SELECT(*ALL) or SELECT(*SYS) is specified on the CPYIGCTBL command. 


Delete a DBCS font table: Use the Delete DBCS Font Table (DLTIGCTBLY command to delete a DBCS 
font table from the server. 
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Delete an unused DBCS font table to free storage space. For example, if you do not plan to use Japanese 
printer 5583 or 5337 with your server, font table QIGC3232 is not needed and can be deleted. 


When deleting a table, do the following: 

1. If desired, copy the table onto a tape, a diskette, or a physical file. If you do not copy the table before 
deleting it, you will not have a copy of the table for future use. 

2. Vary off all devices using that table. 

3. Enter the DLTIGCTBL command. For example, to delete the DBCS font table QIGC3232, enter: 
DLTIGCTBL IGCTBL(QIGC3232) 


4. Press the Enter key. The system sends an inquiry message to the system operator message queue for 
you to confirm your intention to delete a DBCS table. 


5. Respond to the inquiry message. The server sends you a message when it has deleted the table. 
Note: Do not delete a DBCS font table if any device using that table is currently varied on. Also, make 
sure that the affected controller is not varied on. If you try to delete the table while the device and 
controller are varied on, the system reports any devices attached to the same controller(s) as those 
devices, and the controller(s) as damaged the next time you try to print or display extended characters 
on an affected device. If such damage is reported, do the following: 

a. Vary off the affected devices, using the Vary Configuration [(VRYCEG) command. 

Vary off the affected controller. 

Vary on the affected controller. 

Vary on the affected devices. 

Continue normal work. 


DBCS font files 
In addition to the system-supplied DBCS font tables, the system also provides DBCS font files. These 


DBCS font files are physical files which contain frequently used double-byte characters. When using the 
character generator utility, you can use the characters in these files as the base for a new user-defined 

character. These files are supplied with read-only authority as they are not to be changed. If you do not 
use character generator utility or the Advanced DBCS Printer Support licensed program, you may delete 
these files to save space. They all exist in the QSYS library. 


oao fF 


The following DBCS font files are distributed with the DBCS national language versions of the OS/400 
licensed program. They are used as a reference for the CGU and the Advanced DBCS Printer Support 
licensed program. 


QCGF2424 
A Japanese DBCS font file used to store a copy of the Japanese DBCS basic character images. 


QCGF2424K 
A Korean DBCS font file used to store a copy of the Korean DBCS basic character images. 


QCGF2424C 
A Traditional Chinese DBCS font file used to store a copy of the Traditional Chinese DBCS basic 
character images. 


QCGF2424S 
A Simplified Chinese DBCS font file used to store a copy of the Simplified Chinese DBCS basic 
character images. 


DBCS sort tables 
DBCS sort tables contain the sort information and collating sequences of all the double-byte characters 


used on the system. The sort utility on the system uses these tables to sort double-byte characters. 
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DBCS sort tables are objects that you can save, restore and delete. Using the character generator utility 
you can also add, delete and change entries in these tables corresponding to the image entries in the 
DBCS font tables. For Japanese use only, you can also copy the DBCS master sort table to and from a 
data file. 


The following DBCS sort tables are distributed with the DBCS national language versions of OS/400 
licensed program: 


QCGMSTR 
A Japanese DBCS master sort table used to store the sort information for the Japanese 
double-byte character set. 


QCGACTV 
A Japanese DBCS active sort table used to store the sort collating sequences for the Japanese 
double-byte character set. 


QCGMSTRC 
A Traditional Chinese DBCS master sort table used to store the sort information for the Traditional 
Chinese double-byte character set. 


QCGACTVC 
A Traditional Chinese DBCS active sort table used to store the sort collating sequences for the 
Traditional Chinese double-byte character set. 


QCGACTVK 
A Korean DBCS active sort table used to map Hanja characters to Hangeul characters with 
equivalent pronunciation. 


QCGMSTRS 
A Simplified Chinese DBCS master sort table used to store the sort information for the Simplified 
Chinese double-byte character set. 


QCGACTVS 
A Simplified Chinese DBCS active sort table used to store the sort collating sequences for the 
Simplified Chinese double-byte character set. 


You can sort Japanese, Korean, Simplified Chinese, and Traditional Chinese double-byte characters. Each 
of these languages have two DBCS sort tables, a DBCS master sort table and a DBCS active sort table, 
except for Korean which has only a DBCS active sort table. The DBCS master sort table contains sort 
information for all defined DBCS characters. The DBCS active sort table for Japanese, Simplified Chinese, 
and Traditional Chinese is created from the master sort table information and contains the collating 
sequences for the double-byte characters of that given language. These collating sequences have a 
purpose similar to the EBCDIC and ASCII collating sequences for the single-byte alphanumeric character 
set. For Korean characters, the Hangeul characters are assigned both their collating sequence as well as 
their DBCS codes according to their pronunciation. Hence, a separate collating sequence is not required, 
and each of the Hanja characters is mapped to a Hangeul character of the same pronunciation using the 
DBCS active sort table QOGACTVK. 


All DBCS sort tables have an object type of *IGCSRT. 
Commands for DBCS sort tables 


The following commands allow you to manage and use DBCS sort tables. 


* Check Object (CHKOB.I) 
* Save Object (SAVOB.)) 
¢ Restore Object KRSTOB.)) 


Use existing DBCS sort tables 
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You can save the tables to tape or diskette, delete them from the server, and restore them to the server. 
The Japanese DBCS master sort table can also be copied to a data file and copied from a data file so that 
it can be shared with an Application System/Entry* (AS/Entry) system. You can also add sort information 
for each user-defined character, and add that character to the DBCS collating sequence, as you create it 
using the character generator utility. 


Find existing DBCS sort table 
Use the Check Object (CHKOB.) command to find out if a DBCS sort table exists in your system. 


For example, to find out if the table QOGMSTR exists, enter: 
CHKOBJ OBJ(QSYS/QCGMSTR) OBJTYPE(*IGCSRT) 


If the table does not exist, the system responds with a message. If the table does exist, the system simply 
returns without a message. 


Check for the existence of a DBCS active sort table when you want to sort double-byte characters for the 
first time. The DBCS active table for the DBCS language must exist to sort the characters. 


For additional information about DBCS sort tables, see the following: 
¢ Save and restore a DBCS sort table 
¢ Delete a DBCS sort table 


Save and restore a DBCS sort table: The following topics describe how you save and restore DBCS 
sort tables. 


Save a DBCS sort table to tape or diskette 


Save a DBCS sort table onto tape or diskette in the following instances: 

* Before deleting that table 

¢ After information is added, updated, or changed in the tables using the character generator utility 
¢ When planning to use the tables on another iSeries server 


Use the Save Object (SAVOBJ)] command to save a DBCS sort table onto tape or diskette. Specify 
*IGCSRT for the object type. 


The DBCS sort tables are saved when you use the command so you do not have to use the 
command when performing normal system backup. 


Restore a DBCS sort table from tape or diskette 


Use the IRSTOBRJ command to restore a DBCS sort table from a tape or a diskette onto the system. The 
tables on the tape or diskette must previously have been saved using the SAVOBJ command. Specify 
*IGCSRT for the object type. The system automatically re-creates the DBCS sort table when the specified 
table does not already exist in the system. 


These tables must be restored to the QSYS library for the system to know they exist. For that reason, 
RSTOB4J restores *IGCSRT objects only to the QSYS library and only if the objects do not already exist 
there. 


Delete a DBCS sort table: Use the DLTIGCSRT command to delete a DBCS sort table from the system. 
You can delete an unused DBCS sort table to free disk space, but you should always first save a copy of 
the table using the ISAVOB.] command. You should delete the DBCS master sort table for a DBCS 


language if either of the following are true: 
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You will not be creating any new characters for that language using the character generator utility. 
You will not be using the sort utility to sort characters for that language. 


You should delete the DBCS active sort table for a DBCS language if you will not be using the sort utility 
to sort characters for that language. The DBCS active sort table must be on the system to use the sort 
utility for this language. 


When deleting a table, do the following: 


1. 


If desired, save the table onto tape or diskettes. If you do not save the table onto removable media 
before deleting it, you will not have a copy of the table for future use. 


2. Enter the DLTIGCSRT command. For example, to delete the DBCS sort table QOGACTYV, enter: 
DLTIGCSRT IGCSRT(QCGACTV) 
3. Press the Enter key. The system sends you a message when it has deleted the table. 


DBCS field definition 
Consider the characteristics of DBCS data when defining a DBCS field: 


Each DBCS character is 2 bytes long. 
The length of a DBCS character string is always even. 


Shift-out (SO) and shift-in (SI) control characters are required at the beginning and end of the DBCS 
character string, except for graphic-data type fields. Together, these characters are 2 bytes long. 


The system treats DBCS data the same as character data, and therefore cannot perform arithmetic 
operations on it. 


The following DBCS data types can be used to identify DBCS fields: 

— J (DBCS-only) for fields that can contain only bracketed DBCS data. 

— E (DBCS-either) for fields that can contain bracketed DBCS or SBCS data, but not both. 
— O(DBCS-open) for fields that can contain both SBCS and bracketed DBCS data. 

— G (DBCS-graphic) for fields that can contain graphic data without the SO and SI control characters. 


Note: Data type O is allowed in all types of files. Data types J and E are allowed only in database and 
display files. Data type G is allowed in database, display, and printer files. In most cases, the OS/400 
automatically inserts shift-out and shift-in characters. An exception is when data is written into a data 


type G field in a database file. 


For more information on the DBCS data types, see the IDDS Reference: Concepts topic. 


Work with locales 


Locales are used rimarily in ILE-based application programs. Additionally, the Retrieve Locale Information 
(OPM, ILE, QlgRetrieveLocalelInformation) API retrieves one or all categories of a locale. See 


the [0S/400 API] topic for more information. 


Benefits of using locales in global applications 


Applications can be created independent of language, cultural data, or specific characters. Locales can be 


accessed to provide this type of support to any integrated language environment-based application. 


For example, the LC_TIME category within a locale can be defined in any of the following ways, or in any 


combination that is convenient for the environment in which the application runs: 
¢ HH:MM:SS 
¢ MM:SS:HH 
¢ SS:MM:HH 


Creating locales 
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Locales are created using the Create Locale (CRTLOCALE) command. 


2 
The source file used to create the locale is named QLOCALESRC, in the QSYSLOCALE library. This 
library is loaded with option 21 of the operating system. 


% 


These source files cannot be changed. Instead, they must be copied and then edited if changes are 
desired. 


. To see how to 


For a list of source definition files, see 
use the CRTLOCALE command, see 


Working with locales 


The following topics provide more detailed information about how you can use locales: 
* Locale restrictions 

* Locale categories 

* Locale symbolic names 

* Examples: Locale programming 


For more information 


The following links provide additional information about locales: 
* Install and enable locales 
¢ System-supplied locales and recommended CCSIDs 


Locale restrictions 
The following list contains restrictions when using locales to set job attributes: 


* The locale CCSID must be an EBCDIC single-byte CCSID for an SBCS system. 

¢ The locale CCSID must be an EBCDIC, single-byte character set (SBCS), or mixed-byte CCSID for a 
DBCS-capable system. 

¢ The locale object must exist in the QSYS file system. 

¢ The DATFMT, DATSEP, TIMSEP, and DECFMT parameters within the locale must be valid values 
supported as job attributes. See the WT inacemaal topic for more information on jobs and their 
attributes. 


¢ lf you want sort sequence support from the locale, you must use the CPYSYSCOL keyword. See 
CPYSYSCol for more detailed information. 


Locale categories 
The following categories are supported on OS/400. 


Locale category | Description 


[.C_COLLATE Defines character or string collation information 


[LC_CTYPEI Defines character classification, case conversion, and other character attributes. 


_M AGES | Defines the format for affirmative and negative responses. 


[LC_MONETARY| | Defines rules and symbols for formatting monetary numeric information. 


NUMER Defines a list of rules and symbols for formatting non-monetary numeric information. 
Lc Time Defines a list of rules and symbols for formatting time and date information. 
Lc_Too) Defines rules for daylight savings time and time zone information. 
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Note: A locale source file cannot contain duplicate categories. 
Locale category source definitions: 


The category source definition consists of: 
* The category header (category name), where the category name must be all uppercase characters. 


¢ The associated keyword/value pairs that comprise the category body. Keywords may be all uppercase, 
all lowercase, or mixed case characters. 


* The category trailer (which consists of END category-name) 


For example: 


LC_CTYPE 
source for LC_CTYPE category 
END LC_CTYPE 


Lines preceding the first category header can be used to change the comment character and the escape 
character. The comment_char (the default is #) and escape_char (the default is \) keywords can be used 
to change these characters. The following example shows how to change the comment character and 
escape character to * and / respectively: 


comment_char <asterisk> 
escape_char <slash> 


Note: This example uses symbolic names to represent the ’*’ and ’/ characters. 


The source for all categories is specified using the following: 


Keywords 
Each keyword identifies either a definition or a rule. The remainder of the statement containing the 
keyword contains the operands to the keyword. Operands are separated from the keyword by one 
or more blank characters. A statement may be continued on the next line by placing an 
escape_char as the last character before the newline or linefeed character that ends the line. 


Lines containing the comment_char in the first column are treated as comment lines. Comment 
lines cannot be continued on a subsequent line using an escape character. \ is the default escape 
character. However, the escape character can be defined to be any character by the user. 


Strings 
Strings must be enclosed in double-quotes. Double quotes within strings can be represented in 
two ways: 


* Literally. The escape character can be followed by double quotes. 
* Asymbolic name. For example, <quotation-mark>. 


A string can be continued on the next line by placing an escape_char as the last character before 
the newline or linefeed character that ends the line. 


A string is a sequence of character symbols, or literals enclosed by double-quotation (“’) 
characters. For example: 


"<A><B><C>" 


Character literals 
A character literal is the character itself. 


Character symbols 
A character symbol begins with the < (less-than) character, followed by non-control characters, 
and ends with the > (greater-than) character. For example, <A> is a valid character symbol 
(symbolic name). Any character symbol referenced in the source file should be one of the 
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predefined system-supplied symbols. The system supplied symbolic names are in the source file 
member QLGSYMBOL in the QLOCALESRC source file in the optionally installable library 
QSYSLOCALE. 


es for a list of all system-supplied symbolic names. 


In the event that the system does not contain a predefined symbolic name for a character, the 
UCS-2 level-1 format is allowed. The UCS-2 format is based on the character set defined in 
ISO/IEC 10646. The UCS-2 format may also be used in place of the predefined symbolic names. 
The following is an example of the UCS-2 symbolic name format: 


<UXXXX> 


where ’xxxx’ are four hexadecimal digits. For example, <U0041>. The hexadecimal number 0041 
within this symbolic name is the UCS-2 code point that represents the character ’A’. 


Each category must be explicitly defined in a locale definition source file is required. 


See the [Example: POSIX locale) for a complete description of each locale category included in the POSIX 
locale. 


LC_COLLATE category: The LC_COLLATE category defines character or string collation information. 
Within LC_COLLATE you can specify a sort sequence to use using the cpysyscol keyword. The cpysyscol 
keyword value is used in place of the LC_COLLATE category definitions. 


A collation element is the unit of comparison for collation. A collation element may be a character or a 
sequence of characters. Every collation element in the locale has a set of weights, which determine if the 
collation element collates before, equal to, or after the other collation elements in the locale. Each collation 
element is assigned collation weights by the CRTLOCALE command when the locale definition source file 
is created. These collation weights are then used by applications programs that compare strings. 


Every character defined in the CCSID that is specified in the CRTLOCALE command is itself a collating 
element. Additional collating elements can be defined using the collating-element statement. The syntax is: 


collating-element character-symbol from string 


The LC_COLLATE category begins with the LC_COLLATE keyword and ends with the END LC_COLLATE 
keyword. 


The following keywords are recognized in the LC_COLLATE category: 


cpysyscol 
This statement specifies that a system collating sequence table is to be used for the collation 
information for the category. If the locale is intended to be used to set the sort sequence table for 
the job, then it is required that the CPYSYSCOL keyword be used. If the CPYSYSCOL keyword is 
specified, no other keyword may be specified. The syntax for the CPYSYSCOL keyword is: 


CPYSYSCOLsort sequence path name;langid 


The sort sequence path name is a string specifying a fully expanded path name of an existing sort 
sequence table to use as the definition for this category. The path name delimiter must be a slash 
(/). Other valid values are strings containing one of the following: 


*JOB_ The sort sequence of the job. 


*LANGIDUNQ 
The unique-weighted sort sequence table that is associated with the language identifier 
requested parameter. 


*LANGIDSHR 
The shared-weighted sort sequence table that is associated with the language identifier 
requested parameter. 
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*HEX The sort sequence according to the hexadecimal value of the characters. 

The /langid is a string specifying the language identifier of the sort sequence table to be used. All 
langids must be in uppercase. Valid values are strings containing one of the following: 

*JOB Use the language identifier of the job. 


language id 
A valid 3- character language identifier. For example, Danish would be DAN. See 
Ds! for a complete list of valid language 


identifiers. 


Collating-element 
The collating-element statement specifies multi-character collating elements. The syntax for the 
collating-element statement is: 


collating-element symbolic-name from string 


The symbolic-name value defines a collating element that is a string of one or more characters as 
a single collating element. The symbolic-name value cannot duplicate any system predefined 
symbolic name, or any other symbolic name defined in this collation definition. The string value 
specifies a string of two or more characters or character symbols that define the symbolic-name 
value. The following are examples of the syntax for the collating-element statement: 
collating-element <ch> from "<c><h>" 


collating-element <e-acute> from "<acute><e>" 
collating-element <11> from "<1><1>" 


A symbolic-name value defined by the collating-element statement is recognized only with the 
LC_COLLATE category. 


Order_start 
The order_start statement may be followed by one or more collation order statements, assigning 
collation weights to collating elements. This statement is required. The syntax for the order_start 
statement is: 


order_start sort-rules;sort-rules;...sort-rules collation-order-statements order_end 


The sort-rules have the following syntax: 


directive, directive,...directive 
where directive is one of the directives; forward, backward, and position. 


The sort-rules directives are optional. If present, they define the rules to apply during string 
comparison. The number of specified sort-rules directives defines the number of weights each 
collating element is assigned (that is, the number of collation orders in the locale). If no sort-rules 
directives are present, one forward directive is assumed. 


If present, the first sort-rules directive applies when comparing strings using primary weight, the 
second when comparing strings using the secondary weight, and so on. Each set of sort-rules 
directives is separated by a ; (semicolon). A sort-rules directive consists of one or more 
comma-separated directives. The following directives are supported: 


Forward 
Specifies that collation weight comparisons proceed from the beginning of a string toward 
the end of the string. 


Backward 
Specifies that collation weight comparisons proceed from the end of a string toward the 
beginning of the string. 
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Position 
Specifies that collation weight comparisons consider the relative position of non-ignored 
elements in the string. That is, if strings compare equal, the element with the shortest 
distance from the starting point of the string collates first. 


The forward and backward directives are mutually exclusive. The following is an example of the 
syntax for the sort-rules directives: 


order_start forward;backward 


Order_end 
This keyword ends collating order entries introduced by the order_start keyword. 


The order of the characters and elements specified between the order_start and order_end 
keywords defines the character order used in range expressions and regular expressions. If no 
weights are assigned to the characters, then the character order also becomes the collation 
sequence weight. 


Special symbols 


Special symbols are required to be all upper case characters. The following special symbols can be used 
in the LC_COLLATE category: 


* IGNORE 


The optional operands for each collation element are used to define the primary, secondary, or 
subsequent weights for the collating element. The special symbol IGNORE is used to indicate a 
collating element that is to be ignored when strings are compared. 

¢ UNDEFINED 
All characters in the character set must be placed in the collation order, either explicitly or implicitly, by 
using the Undefined symbol. The UNDEFINED symbol includes all coded character set values not 
specified explicitly. These characters are inserted in the character collation order at the point indicated 
by the Undefined symbol in the order of their character code page values. If a collating weight is not 
explicitly specified for the UNDEFINED symbol, then by default, all of the undefined characters are 
assigned the same collating weight equal to the relative order of the first undefined character in the 
collating sequence. If no UNDEFINED special symbol exists and the collation order does not specify all 
collation elements from the coded character set, a warning is issued and all undefined characters are 
placed at the end of the character collation order and be given the same collating weight. 


Example 1: 


The following is an example of a collation order statement in the LC_COLLATE locale definition source file 
category. 


The text below the LC_COLLATE keywords has been added for clarity and does not appear in the locale 
source file. 


order_start forward;backward 
The order_start has two sort rules specified: 
# forward and backward 


UNDEFINED IGNORE; IGNORE 


# The UNDEFINED special symbol indicates that 

# all characters in the CCSID of the locale 

# that are not specified in the definition 

# are ignored for collation purposes. 

<LOW> 

# <LOW> is a collating symbol that is ordered 

# after all undefined characters. For example, if there 
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# were only two undefined characters, then the <LOW> symbol 
# would be third in the order. 


# All collating elements between <space> and <a> have the 
# same primary equivalence class and individual secondary 
# weights based on their coded character set values. 


<a> <a>3<a> 
<a-acute> <a>;<a-acute> 
<a-grave> <a>;<a-grave> 
<A> <a>; <A> 

<A-acute> <a>;<A-acute> 
<A-grave> <a>;<A-grave> 


# All characters between <a> and <A-grave> belong to the 

# same primary equivalence class because they have the same 
# primary weight. 

<ch> <ch>3;<ch> 

<Ch> <ch>;<Ch> 

# The <c><h> multi-character collating element is 

# represented by the <ch> collating symbol and belongs to the 
# same primary equivalence class as the <Ch> multi-character 
# collating element. 

<s> <S>3<s> 

<eszet> "<g><s>"3<s> 

# A one-to-many mapping is indicated by the <eszet> 

# character collated as an <s><s> string. That is, one 

# <eszet> character is expanded to <s><s> characters before 
# comparing. 

<HIGH> 

order_end 

Example 2: 


Following is an example of a CPYSYSCOL statement in the LC_COLLATE locale definition source file 
category. 
LC_COLLATE 


CPYSYSCOL "//QSYS.LIB//QLA10025S.TBL"; "ENU" 


END LC_COLLATE 


LC_CTYPE category: The LC_CTYPE category defines character classification, case conversion, and 
other character attributes. 


The LC_CTYPE category begins with an LC_CTYPE category header and ends with an END LC_CTYPE 
category trailer. 


All operands for LC_CTYPE category statements are defined as lists of characters. Each list consists of 
one or more semicolon-separated characters or symbolic character names. 


The following keywords are recognized in the LC_CTYPE category. In the descriptions, the term 
automatically included means that an error does not occur if the referenced characters are included or 
omitted. The characters are provided if they are missing and are accepted if they are present. In the event 
that the automatically included characters do not exist in the CCSID that you want to create the locale, a 
warning is issued by the CRTLOCALE command. 


upper Defines uppercase letter characters. No character defined by the cnirl, digit, punct, or space 
keyword can be specified. At a minimum, the uppercase letters A through Z are automatically 
included. 
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lower 


alpha 


digit 


space 


cntrl 


punct 


graph 


print 


xdigit 


blank 


Defines lowercase letter characters. No character defined by the cnitrl, digit, punct, or space 
keyword can be specified. At a minimum, the lowercase letters a through z are automatically 
included. 


Defines all letter characters. No character defined by the cnirl, digit, punct, or space keyword can 
be specified. Characters defined by the upper and lower keywords are automatically included in 
this character class. 


Defines numeric digit characters. Only the digits 0, 1, 2, 3, 4, 5, 6, 7, 8, and 9 can be specified. 


Defines white space characters. No character defined by the upper, lower, alpha, digit, graph, or 
xdigit keyword can be specified. At a minimum, the <space>, <form-feed>, <newline>, <carriage 
return>, <tab>, <vertical-tab> characters, and any characters defined by the blank keyword, are 
automatically included. 


Defines control characters. No character defined by the upper, lower, alpha, digit, punct, graph, 
print, or xdigit keywords can be specified. 


Defines punctuation characters. A character defined as the <space> character and characters 
defined by the upper, lower, alpha, digit, cntrl, or xdigit keyword cannot be specified. 


Defines printable characters, excluding the <space> character. If this keyword is not specified, 
characters defined by the upper, lower, alpha, digit, xdigit, and punct keywords are automatically 
included in this character class. No character defined by the cntrl keyword can be specified. 


Defines printable characters, including the <space> character. If this keyword is not specified, the 
<space> character and characters defined by the upper, lower, alpha, digit, xdigit, and punct 
keywords are automatically included in this character class. No character defined by the cntrl 
keyword can be specified. 


Defines hexadecimal digit characters. Only the digits 0, 1, 2, 3, 4, 5, 6, 7, 8, and 9 and the letters 
A, B, C, D, E, F, a, b, c, d, e, and f can be specified. If not specified, the xdigit class defaults to 
the digits 0, 1, 2, 3, 4, 5, 6, 7, 8, and 9 and the letters A, B, C, D, E, F, a, b, c, d, e, and f. 


Defines blank characters. If this keyword is not specified, the <space> and <horizontal-tab> 
characters are included in this character class. 


toupper 


tolower 


Defines the mapping of lowercase characters to uppercase characters. Operands for this keyword 
consist of semicolon-separated character pairs. Each character pair is enclosed in ( ) 
(parentheses) and separated from the next pair by a , (comma). The first character in each pair is 
considered lowercase; the second character is considered uppercase. Only characters defined by 
the lower and upper keywords can be specified. 


Defines the mapping of uppercase characters to lowercase characters. Operands for this keyword 
consist of semicolon-separated character pairs. Each character pair is enclosed in ( ) 
(parentheses) and separated from the next pair by a , (comma). The first character in each pair is 
considered uppercase; the second character is considered lowercase. Only characters defined by 
the lower and upper keywords can be specified. 


Note: The tolower keyword is optional. If this keyword is not specified, the mapping defaults to 
the reverse mapping of the toupper keyword, if specified. If the toupper keyword is not specified, 
the mapping defaults to the C locale. 


Example: 


The following is an example of a LC_CTYPE category in a locale definition source file: 
LC_CTYPE 


#"alpha" is by default "upper" and "lower" 
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#"print" is by default "alpha", "digit", "punct", and the space character 
#"graph" is by default "alnum" and "punct" 
#"tolower" is by default the reverse mapping of "toupper" 


# 

upper <A>;<B>;<C>;<D>;<E>;<F>;<G>3;<H>;<I>3;<J>;<K>;<L>;<M>;\ 
<N>3<0>;<P>;<Q>;<R>;<S>3<T>;<U>3;<V>;<W>;<X>;<Y>3;<Z> 

# 


lower <a>; <b>;<c>;<d>;<e>;<f>;<g>;<h>;<i>;<j>;<k>3;<1>;<m>;\ 
<n>3;<0>3;<p>3;<q>3<r>3<S>3<t>3<u>3;<v>3<w>3;<x>3<y>3;<z> 

# 

digit <zero>;<one>;<two>;<three>;<four>;<five>;<six>;\ 
<seven>;<eight>;<nine> 

# 

Space <tab>;<newl ine>;<vertical-tab>;<form-feed>; \ 
<carriage-return>;<space> 

# 


cntrl <alert>;<backspace>;<tab>;<newl ine>;<vertical-tab>; \ 
<form-feed>;<carriage-return>;<NUL>;<SOH>;<STX>; \ 
<ETX>;<EOT>;<ENQ>;<ACK>;<S0>;<SI>;<DLE>;<DC1>;<DC2>; \ 
<DC3>;<DC4>;<NAK>;<SYN>3;<ETB>;<CAN>;<EM>;<SUB>; \ 
<ESC>3;<IS4>;<IS3>;<IS2>;<IS1>;<DEL> 

# 

punct <exclamation-mark>;<quotation-mark>;<number-sign>; \ 
<dollar-sign>;<percent-sign>;<ampersand>;<asterisk>; \ 
<apostrophe>;<left-parenthesis>;<right-parenthesis>; \ 
<plus-sign>;<comma>;<hyphen>;<period>;<slash>;\ 
<colon>;<semicolon>;<less-than-sign>;<equals-sign>; \ 
<greater-than-sign>;<question-mark>;<commercial-at>; \ 
<left-square-bracket>;<backslash>;<circumflex>; \ 
<right-square-bracket>;<under]l ine>;<grave-accent>; \ 
<left-curly-bracket>;<vertical-line>;<tilde>;\ 
<right-curly-bracket> 

# 


xdigit <zero>;<one>;<two>;<three>;<four>;<five>;<six>;\ 

<seven>;<eight>;<nine>;<A>;<B>;<C>;<D>;<E>;<F>; \ 

<a>;<b>;<c>;<d>;<e>;<f> 

# 

blank <space>;<tab> 

# 

toupper (<a>,<A>) ; (<b>,<B>) ; (<c>,<C>) ; (<d>,<D>) ; (<e>,<E>);\ 
(<f>,<F>) ; (<g>,<G>) ; (<h>,<H>) ; (<i>,<I>) 3 (<j>,<J>) 3\ 
<k>,<K>) 3 (<1>,<L>) ; (<m>,<M>) ; (<n>,<N>) ; (<o>,<0>);\ 
<p>, <P>) ; (<q>,<Q>) ; (<r>,<R>) ; (<s>,<S>) ; (<t>,<T>) 3 \ 
<u>, <U>) ; (<v>,<V>) 5 (<w>,<W>) 5 (<x>,<X>) 3 (<y>,<Y>)3\ 


LC_MESSAGES category: The LC_MESSAGES category of a locale definition source file defines the 
format for affirmative and negative system responses. This category begins with an LC_MESSAGES 
category header and ends with an END LC_MESSAGES category trailer. 


All operands for the LC_MESSAGES category are defined as strings or extended regular expressions 
enclosed by double-quotation marks (""). 


Note: For additional information, see the EEx x i topic, below. These 
operands are separated from the keyword they define bye one or more blanks. Two adjacent 
double-quotation marks ("”) indicate an undefined value. The following keywords are recognized in the 
LC_MESSAGES category: 


yesexpr 
Specifies an extended regular expression that describes the acceptable affirmative response to a 
question expecting an affirmative or negative response. 
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noexpr 


Specifies an extended regular expression that describes the acceptable negative response to a 
question expecting an affirmative or negative response. 


yesstr A fixed string of acceptable affirmative response. 


nostr_ A fixed string of acceptable negative response. 


Extended regular expressions: The following special characters are used to form extended regular 


expressions: 


Character 
+ 


() 
{m} 


{m,} 


{m, n} 


[String] 


[’ String] 


. (period) 


* (asterisk) 
\ (backslash) 


Function 

Specifies that a string matches if one or more occurrences 
of the character or extended regular expression that 
precedes the + (plus) are within the string. 

Specifies that a string matches if zero or one occurrences 
of the character or extended regular expression that 
precedes the ? (question mark) are within the string. 
Specifies that a string matches if either of the strings 
separated by the | (vertical line) are within the string. 
Groups strings together in regular expressions. 

Specifies that a string matches if exactly m occurrences of 
the pattern are within the string. 

Specifies that a string matches if at least m occurrences 
of the pattern are within the string. 

Specifies that a string matches if between m and n, 
inclusive, occurrences of the pattern are within the string ( 
where m <= n). 

Signifies that the regular expression matches any 
characters specified by the string variable within the 
square brackets. 

A % (caret) within the [ ] (square brackets) and at the 
beginning of the specified string indicates that the regular 
expression does not match any characters within the 
square brackets. 

Signifies the beginning of a field or record. 

Signifies the end of a field or record. 

Signifies any one character except the terminal new-line 
character at the end of a space. 

Signifies zero or more of any characters. 

The escape character. When preceding any of the 
characters that have special meaning in extended regular 
expressions, the escape character removes any special 
meaning for the character. 


Character class expressions may also be specified in the extended regular expression. The following 
character class expressions are supported in all locales: 


[:alnum:] 
[:alpha:] 
[:blank:] 
[:entr1:] 
[:digit:] 
[:graph:] 
[: lower:] 
[:print:] 
[:punct:] 
[:space:] 
[:upper:] 
[:xdigit:] 
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Example: 


The following is an example of a LC_MESSAGES category in a locale definition source file: 
LC_MESSAGES 

# 

yesexpr "[yy]" 

noexpr "[nN]" 

yesstr "yes" 

nostr "no" 

# 

END LC_MESSAGES 


LC_MONETARY category: The LC_MONETARY category of a locale definition source file defines rules 
and symbols for formatting monetary numeric information. This category begins with an LC_MONETARY 
category header and ends with an END LC_MONETARY category trailer. 


All operands for the LC_MONETARY category keywords are defined as string or integer values. String 
values are bounded by double-quotation marks (“’). All values are separated from the keyword they define 
by one or more spaces. Two adjacent double-quotation marks indicate an undefined string value. A -1 
indicates an undefined integer value. The following keywords are recognized in the LC_MONETARY 
category: 


int_curr_symbol 
Specifies the string used for the international currency symbol. The operand for the 
int_curr_symbol keyword is a four-character string. The first three characters contain the 
alphabetic international-currency symbol. The fourth character specifies a character separator 
between the international currency symbol and a monetary quantity. 


currency_symbol 
Specifies the string used for the local currency symbol. 


mon_decimal_point 
Specifies the string used for the decimal delimiter used to format monetary quantities. 


mon_thousands_sep 
Specifies the string used for grouping digits to the left of the decimal delimiter in formatted 
monetary quantities. 

mon_grouping 
Defines the size of each group of digits in formatted monetary quantities. The operand for the 
mon_grouping keyword consists of a sequence of semicolon-separated integers. Each integer 
specifies the number of digits in a group. The initial integer defines the size of the group 
immediately to the left of the decimal delimiter. The following integers define succeeding groups to 
the left of the previous group. If the last digit is not -1, subsequent grouping is performed using the 
previous digit. If the last digit is -1, grouping is only performed for the number of groups specified. 


The following is an example of the interpretation of the mon_grouping keyword. Assuming the 
value to be formatted is 123456789 and the operand for the mon_thousands_sep keyword is 
comma (,), the following results occur: 


mon_grouping Value 
Formatted Value 


35-1 123456,789 
3 123,456,789 
3;2 12,34,56,789 
3325-1 1234,56,789 
positive_sign 
Specifies the string used to indicate a nonnegative-valued formatted monetary quantity. 
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negative_sign 
Specifies the string used to indicate a negative-valued formatted monetary quantity. 


int_frac_digits 
Specifies an integer value representing the number of fractional digits (those after the 
decimal delimiter) to be displayed in a formatted monetary quantity using the 
int_curr_symbol value. 


frac_digits 
Specifies an integer value representing the number of fractional digits (those after the 
decimal delimiter) to be displayed in a formatted monetary quantity using the 
currency_symbol value. 


p_cs_precedes 
Specifies an integer value indicating whether the int_curr_symbol or currency_symbol 
string precedes or follows the value for a non-negative formatted monetary quantity. The 
following integer values are recognized: 


0 Indicates that the currency symbol follows the monetary quantity. 


1 Indicates that the currency symbol precedes the monetary quantity. 


p_sep_by_space 
Specifies an integer value indicating whether the int_curr_symbol or currency_symbol 
string is separated by a space from a non-negative formatted monetary quantity. The 
following integer values are recognized: 


0 Indicates that no space separates the currency symbol from the monetary quantity. 

1 Indicates that a space separates the currency symbol from the monetary quantity. 

2 Indicates that a space separates the currency symbol and the positive_sign string, 
if adjacent. 


n_cs_precedes 
Specifies an integer value indicating whether the int_curr_symbol or currency_symbol 
string precedes or follows the value for a negative formatted monetary quantity. The 
following integer values are recognized: 


0 Indicates that the currency symbol follows the monetary quantity. 


1 Indicates that the currency symbol precedes the monetary quantity. 


n_sep_by_space 
Specifies an integer value indicating whether the int_curr_symbol or currency_symbol 
string is separated by a space from a negative formatted monetary quantity. The following 
integer values are recognized: 


0 Indicates that no space separates the currency symbol from the monetary quantity. 
1 Indicates that a space separates the currency symbol from the monetary quantity. 
2 Indicates that a space separates the currency symbol and the negative_sign 


string, if adjacent. 


p_sign_posn 
Specifies an integer value indicating the positioning of the positive_sign string for a 
non-negative formatted monetary quantity. The following integer values are recognized: 
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0 Indicates that parenthesis enclose both the monetary quantity and the 
int_curr_symbol or currency_symbol string. 


1 Indicates that the positive_sign string precedes the quantity and the 
int_curr_symbol or currency_symbol string. 


2 Indicates that the positive_sign string follows the quantity and the int_curr_symbol 
or currency_symbol string. 


3 Indicates that the positive_sign string immediately precedes the int_curr_symbol or 
currency_symbol string. 


4 Indicates that the positive_sign string immediately follows the int_curr_symbol or 
currency_symbol string. 


n_sign_posn 
Specifies an integer value indicating the positioning of the negative_sign string for a 
negative formatted monetary quantity. The following integer values are recognized: 


0 Indicates that parenthesis enclose both the monetary quantity and the 
int_curr_symbol or currency_symbol string. 


1 Indicates that the negative_sign string precedes the quantity and the 
int_curr_symbol or currency_symbol string. 


2 Indicates that the negative_sign string follows the quantity and the int_curr_symbol 
or currency_symbol string. 


3 Indicates that the negative_sign string immediately precedes the int_curr_symbol 
or currency_symbol string. 


4 Indicates that the negative_sign string immediately follows the int_curr_symbol or 
currency_symbol string. 


Examples: 


The following is an example of the LC_MONETARY category listed in a locale definition source file: 
LC_MONETARY 


# 
int_curr_symbol "<Y><S><D>" 
currency_symbol "<dollar-sign>" 


mon_decimal_point '"<period>" 
mon_thousands_sep "<comma>" 


mon_grouping 3;-1 
positive_sign "<plus-sign>" 
negative_sign "<hyphen>" 


int_frac_digits 
frac_digits 
p_cs_precedes 
p_sep_by_space 
n_cs_precedes 
n_sep_by_space 
p_sign_posn 
n_sign_posn 

# 

END LC_MONETARY 


WWNR NF PH P 


3 for another example relating to monetary formats. 
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LC_NUMERIC category: Defines rules and symbols for formatting non-monetary numeric information. 


The LC_NUMERIC category of a locale definition source file defines rules and symbols for formatting 
non-monetary numeric information. This category begins with an LC_NUMERIC category header and 
terminates with an END LC_NUMERIC category trailer. 


All operands for the LC_NUMERIC category keywords are defined as string or integer values. String 
values are bounded by double-quotation marks (””). All values are separated from the keyword they define 
by one or more spaces. Two adjacent double-quotation marks indicate an undefined string value. A -1 
indicates an undefined integer value. The following keywords are recognized in the LC_NUMERIC 
category: 


decimal_point 
Specifies a string containing the decimal delimiter character used to format numeric, non-monetary 
quantities. 


thousands_sep 
Specifies the string separator used for grouping digits to the left of the decimal delimiter in 
formatted numeric, non-monetary quantities. 


grouping 
Defines the size of each group of digits in formatted monetary quantities. The operand for the 
grouping keyword consists of a sequence of semicolon-separated integers. Each integer specifies 
the number of digits in a group. The initial integer defines the size of the group immediately to the 
left of the decimal delimiter. The following integers define succeeding groups to the left of the 
previous group. Grouping is performed for each integer specified for the grouping keyword. If the 
last digit is not -1, subsequent grouping is performed using the previous digit. If the last digit is -1, 
grouping is only performed for the number of groups specified. 


The following is an example of the interpretation of the grouping statement. Assuming the value to be 
formatted is 123456789 and the operand for the thousands_sep keyword is comma (,), the following 
results occur: 


Grouping value Formatted value 
3 123,456,789 

3:-1 123456,789 

3;2 12,34,56,789 
3:2--4 1234,56,789 
Example: 


Following is an example of a LC_NUMERIC category in a locale definition source file: 


LC_NUMERIC 

# 

decimal_point "<period>" 
thousands_sep "<comma>" 
grouping 3;-1 

# 


END LC_NUMERIC 


LC_TIME category: The LC_TIME category of a locale definition source file defines rules and symbols 
for formatting time and date information. This category begins with an LC_TIME category header and 
terminates with an END LC_TIME category trailer. 


All operands for the LC_TIME category keywords are defined as string or integer values. String values are 


bounded by double-quotation marks (""). All values are separated from the keyword they define by one or 
more spaces. Two adjacent double-quotation marks indicate an undefined string value. A -1 indicates an 
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undefined integer value. Field descriptors are used by commands and subroutines that query the LC_TIME 
category to represent elements of time and date formats. The following keywords are recognized in the 
LC_TIME category: 


abday 


day 


abmon 


mon 


Defines the abbreviated weekday names corresponding to the %a field descriptor. Recognized 
values consist of seven semicolon-separated strings. The first string corresponds to the 
abbreviated name for the first day of the week (Sun), the second to the abbreviated name for the 
second day of the week, and so on. 


Defines the full spelling of the weekday names corresponding to the %A field descriptor. 
Recognized values consist of seven semicolon-separated strings. The first string corresponds to 
the full spelling of the name of the first day of the week (Sunday), the second to the name of the 
second day of the week, and so on. 


Defines the abbreviated month names corresponding to the %b field descriptor. Recognized values 
consist of 12 semicolon-separated strings. The first string corresponds to the abbreviated name for 
the first month of the year (Jan), the second to the abbreviated name for the second month of the 
year, and so on. 


Defines the full spelling of the month names corresponding to the %B field descriptor. Recognized 
values consist of 12 semicolon-separated strings. The first string corresponds to the full spelling of 
the name for the first month of the year (January), the second to the full spelling of the name for 
the second month of the year, and so on. 


d_t_fmt 


d_fmt 


Defines the string used for the standard date and time format corresponding to the %c field 
descriptor. The string can contain any combination of characters, field descriptors, or escape 
sequences. See Escape Sequences (see page L145) for additional information. 


Defines the string used for the standard date format corresponding to the %x field descriptor. The 
string can contain any combination of characters, field descriptors, or escape sequences. 
Following is an example of how the d_fmt keyword can be constructed: 


%D The %D indicates a %m/%d/“%y date format. If you are using this format and have chosen 
to set the job attribute from the locale, then a ’/ is extracted for the DATSEP job attribute. 
*MDY is extracted for the DATFMT job attribute. 


%j The %j indicates a Julian date format. If you are using this format and have chosen to set 
the job attribute from the locale, then no DATSEP job is extracted. However, *JUL is 
extracted for the DATFMT job attribute. 


%d-%m-Yy 
If you are using this format and have chosen to set the job attribute from the locale, then 
the compiler extracts - for the DATSEP job attribute and *DMY for the DATFMT job 
attribute. 


Py. %M.%od 
If you are using this format and have chosen to set the job attribute from the locale, then 
the compiler extracts . for the DATSEP job attribute and *YMD for the DATFMT job 
attribute. 


Ym/%d/%Y 
If you are using this format and have chosen to set the job attribute from the locale, then 
the compiler extracts / for the DATSEP job attribute. No DATFMT job attribute is extracted. 


Note: If the locale is to contain a valid OS/400 date format and date separator, then the d_fmt 
value must be defined such that it contains valid OS/400 date format and date separators. For 
example, if the value was specified as: %m/%d/%y, then *MDY would be extracted for the OS/400 
date format and a/ would be extracted for the OS/400 date format. A warning is issued by the 
CRTLOCALE command if an OS/400 date format or date separator cannot be extracted. 
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t_fmt 


Defines the string used for the standard time format corresponding to the %X field descriptor. The 
string can contain any combination of characters, field descriptors, or escape sequences. 
Following is an example of how the t_fmt keyword can be constructed: 


%H:%M:%S 
The compiler extracts a : (colon) for the TIMSEP job attribute. 


%H.%M.%S 
The compiler extracts a . (period) for the TIMSEP job attribute. 


%H %M %S 

The compiler extracts a blank space for the TIMSEP job attribute. 
%H,%M,%S 

The compiler extracts a , (comma) for the TIMSEP job attribute. 


%T %T implies a %H:%M:%S (hours, minutes, seconds) time format with a : (colon) as the 
TIMSEP job attribute. 


VHR%YME%S; 
A valid TIMSEP job attribute could not be determined. 


Note: If the locale is to contain a valid OS/400 time separator, then the t_fmt value must be 
defined such that it contains a valid OS/400 time separator. For example, if the value was 
specified as: %H:%M:%S, then a : (colon) would be extracted for the OS/400 date format. A 
warning is issued by the CRTLOCALE command if an OS/400 time separator cannot be extracted. 


am_pm 


Defines the strings used to represent ante meridiem (before noon) and post meridiem (after noon) 
corresponding to the %p field descriptor. Recognized values consist of two strings separated by a ; 
(semicolon). The first string corresponds to the ante meridiem designation, the last string to the 
post meridiem designation. 


t_fmt_ampm 


era 
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Defines the string used for the standard 12-hour time format that includes an am_pm value (%p 
field descriptor). This statement corresponds to the %r field descriptor. The string can contain any 
combination of characters and field descriptors. 


Defines how the years are counted and displayed for each era in a locale, corresponding to the 
%E field descriptor modifier. For each era, there must be one string in the following format: 


direction:offset:start_date:end_date:era_name:era_format 
The variables for the era-string format are defined as follows: 


direction 
Specifies a - (minus sign) or + (plus sign) character. The plus character indicates that 
years count in the positive direction when moving from the start date to the end date. The 
minus character indicates that years count in the negative direction when moving from the 
start date to the end date. 


offset Specifies a number representing the first year of the era. 


start_date 
Specifies the starting date of the era in the yyyy/mm/dd format, where yyyy, mm, and dd 
are the year, month, and day, respectively. Years prior to the year AD 1 are represented as 
negative numbers. For example, an era beginning March 5th in the year 100 BC would be 
represented as -100/03/05. 
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end_date 
Specifies the ending date of the era in the same form used for the start_date variable or 
one of the two special values -* or +*. A -* value indicates that the ending date of the era 
extends backward to the beginning of time. A +* value indicates that the ending date of the 
era extends forward to the end of time. Therefore, the ending date can be chronologically 
before or after the starting date of the era. For example, the strings for the Christian eras 
AD and BC would be entered as follows: 


+:0:0000/01/01:+*:AD:%0 %N 
+:1:-0001/12/31:-*:BC:%o %N 


era_name 
Specifies a string representing the name of the era that is substituted for the %EC field 
descriptor. 


era_format 
Specifies a string for formatting the %EY field descriptor. 


An era value consists of one string for each era. If more than one era is specified, each era string 
is separated by a ; (semicolon). 


era_d_fmt 
Defines the string used to represent the date in alternate-era format corresponding to the %Ex 
field descriptor. The string can contain any combination of characters and field descriptors. 


era_t_fmt 
Defines the string used to represent the time in alternate-era format corresponding to the %EX 
field descriptor. The string can contain any combination of characters and field descriptors. 


era_d_t_fmt 
Defines the string used to represent the date and time in alternate-era format corresponding to the 
%Ec field descriptor. The string can contain any combination of characters and field descriptors. 


alt_digits 
Defines alternate strings for digits corresponding to the %O field descriptor. Recognized values 
consist of a group of strings separated by ; (semicolons). The first string represents the alternate 
string for zero, the second string represents the alternate string for one, and so on. A maximum of 
100 alternate strings can be specified. 


Escape sequences 


The following are escape sequences allowed for the d_t_fmt, d_fmt, and t_fmt keyword values: 


> 

\ Represents the backslash character. 

\a Represents the alert character. 

\b Represents the backspace character. 

\f Represents the form-feed character. 

\n Represents the newline character. 

\r Represents the carriage-return character. 
\t Represents the tab character. 

\v Represents the vertical-tab character. 
Example: 


The following is an example of a LC_TIME category in a locale definition source file: 
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LC_TIME 


# 

#Abbreviated weekday names (%a) 

abday "<S><u><n>" 3 '"<M><o><n>"3'"<T><u><e>"3 "<W><e><d>";\ 
"<T><h><u>"3 "<F><r><i>"3'"'<S><a><t>" 

# 


#Full weekday names (%A) 

day "<S><u><n><d><a><y>" 3 "<M><o><n><d><a><y>";\ 
"<T><u><e><s><d><a><y>" 3 '"<W><e><d><n><e><s><d><a><y>"; \ 
"<T><h><u><r><s><d><a><y>" 5 '"<F><r><i><d><a><y>";\ 
"<S$><a><t><u><r><d><a><y>" 


# 

#Abbreviated month names (%b) 

abmon "<J><a><n>";'"<F><e><b>";"<M><a><r>"3"<A><p><r>";\ 
"<M><a><y>"3"<J><u><n>"53"<J><u><]1>"3"<A><u><g>";\ 
"<S><e><p>"3"<Q><c><t>"; '"'<N><o><v>"; "<D><e><c>" 

# 


#Full month names (%B) 

mon "<J><a><n><u><a><r><y>"3 '"<F><e><b><r><u><a><r><y>"; \ 
"<M><a><r><c><h>"; "<A><p><r><i><]>"5"<M><a><y>";\ 
"<J><u><n><e>" 3 '"<J><u><]><y>"; "<A><u><g><u><s><t>"; \ 
"<S><e><p><t><e><m><b><e><r>"; "<0><c><t><o><b><e><r>"; \ 
"<N><o><v><e><m><b><e><r>"; "<D><e><c><e><m><b><e><r>" 

# 

#Date and time format (%c) 

d_t_fmt "%a %b %d %H:%M:%S %Y" 


# 

#Date format (%x) 

d_fmt "Sm/%d/%y" 

# 

#Time format (%X) 

t_fmt "%H:%4M:%S" 

# 

#Equivalent of AM/PM (%p) 

am_pm "<f\><M>"; '"'<P><M>" 

# 

#12-hour time format (%r 

t_fmt_ampm "%I:%M:%S %p" 

# 

era "+:0:0000/01/01:+*:AD:%EC"; \ 
"+:1:-0001/12/31:-*:BC:%Ey"s 
era_d_fmt "" 

alt_digits "<Q><t><h>";"<1><s><t>"3"<2><n><d>"3"<3><r><d>";\ 


"<4><t><h>"3"<5><t><h>"3"<6><t><h>"3"<7><t><h>";\ 
"<8><t><h>"3'"'<9><t><h>";3 '"<1><0><t><h>" 

# 

END LC_TIME 


LC_TOD category: The LC_TOD category defines the rules used to define the start and end time of 
daylight savings time, the difference between local time and Greenwich Mean time, the time zone name, 
and the daylight savings time name. This category is an IBM extension and must appear after all other 
category definitions in the source file. 


All the operands for the LC_TOD category are defined as string or integer values. String values are 
bounded by double-quotation marks (""). All values are separated from the keyword they define by one or 
more spaces. Two adjacent double-quotation marks indicate an undefined string value. A 0 (zero) indicates 
an undefined integer value. The following keywords are recognized in the LC_TOD category. 


tzdiff Specifies an integer value representing the time zone difference in minutes. It is the difference 
between the local time and Greenwich mean time. 


tname Specifies the string used for the time zone name. 


dstname 
Specifies the string used for the daylight savings time name. 
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dststart 
Specifies a set of four integers representing the start date for the daylight savings time. The 
operand for the dststart keyword consists of a sequence of four comma-separated integers in the 
following format: 


month, week, day, time 
The variables for the dststart format are defined as: 


month 
Specifies an integer value representing the month of the year when Daylight Savings Time 
(DST) takes effect. This value ranges from 1 to 12, with 1 corresponding to January, and 
12 corresponding to December. 


week Specifies an integer value representing the week of the month when DST takes effect. 
This value ranges from -4 to 4, with -4 corresponding to the fourth week of the month 
counting from the end of the month and 4 corresponding to the fourth week of the month 
counting from the beginning of the month. 


day __ Specifies an integer value representing the day of the month when DST takes effect or if 
the week keyword is not 0 (zero), then this is the day of the week when DST takes effect. 
This value ranges from 1 to the last day of the month or 1 to the last day of the week. 


time Specifies an integer value representing the number of seconds after 12 midnight, local 
standard time, when DST takes effect. This value ranges from 0 to 86399. 


dstend 
Specifies a set of four integers representing the end date for the daylight savings time. The 
operand for the dstend keyword consists of a sequence of four comma-separated integers in the 
following format: 


month, week, day, time 
The variables for the dstend format are defined as: 


month 
Specifies an integer value representing the month of the year when Daylight Savings Time 
(DST) ends. This value ranges from 1 to 12, with 1 corresponding to January, and 12 
corresponding to December. 


week Specifies an integer value representing the week of the month when DST ends. This value 
ranges from -4 to 4, with -4 corresponding to the fourth week of the month counting from 
the end of the month and 4 corresponding to the fourth week of the month counting from 
the beginning of the month. 


day __ Specifies an integer value representing the day of the month when DST ends or if the 
week keyword is not 0 (zero), then this is the day of the week when DST ends. This value 
ranges from 1 to the last day of the month or 1 to the last day of the week. 


time Specifies an integer value representing the number of seconds after 12 midnight, local 
standard time, when DST takes effect. This value ranges from 0 to 86399. 


dstshift 

Specifies an integer value representing the daylight savings time shift in seconds. 
Example: 
The following is an example of a LC_TOD category in a locale definition source file: 


a 
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LC_TOD 


# 
tzdiff -360 
tname "<C><e><n><t><r><a><] >" 


dstname "<C><D><T>" 


#Set daylight savings time to start on 3rd week of October at 
#midnight on Saturday. 
dststart 10,3,6,0 


#Set daylight savings time to end on April 23, at midnight. 


dstend 4,0,23,0 
dstshift 3600 

# 

END LC_TOD 


4 


Locale symbolic names 

OS/400 supports locale symbolic names based on predefined names from the X/Open Standard lportabld 
bharacter rf In addition, OS/400 supports a 5-character alphanumeric symbolic name for all characters, 
where: 


¢ The first character of the symbolic name is a Latin capital letter U. This character identifies that the 
name is derived from the ISO/IEC 10646 Universal Coded Character Set. 


* The second through fifth characters of the symbolic name represent the code point of the character in 
the ISO/IEC 10646 Universal Coded Character Set 2 Level 1. This portion of the symbolic name is 
assigned by code point for ease of creating and changing locales. 


As an example, the question mark (?) character provides the following correlation between symbolic 
naming, UCS2-1 code point, and an IBM-assigned code point: 


* The ? character is symbolically represented by <question-mark> 
¢ It is at code point U003F in the ISO 10646 code page 
° It is at code point 6F in IBM code page 500. 


Mapping of locale symbolic names provides a list of all symbolic names supported on OS/400. The table 
also provides the UCS2-1 (ISO 10646) code points, their corresponding IBM code page or code points, 
and a graphic representation of each character. 


Examples: Locale programming 
In addition to the following examples, Locale categories provides programming examples for each of the 
different locale categories: 


* Example: How locales work 

* Example: Creating locales 

* Example: Producing unique monetary formats 

* Example: Locales as part of a multilingual environment 
* Example: POSIX locale 

* Example: EN US locale 


Example: How locales work: Following are two examples that focus on the LOCALE and SETJOBATR 
parameter values specified on the user profile. 


The first example illustrates using locales to establish job attributes. The user profile parameters LOCALE 


and SETJOBATR have values of *SYSVAL. This means that the job attributes at job start up time come 
from the QLOCALE value based on the values in QSETJOBATR. 
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Job attributes (from user profile) 


- CCSID = From locale XYZ 
- TIMSEP = Fromlocale XYZ 
- DATFMT = Fromlocale XYZ 
- DATSEP = Fromlocale XYZ 
- SRTSEQ = Fromlocale XYZ 


Environment variable 
- LANG = /QSYS.LIB/MYLIB.LIB/ 
AYZ.LOCALE 


User profile parameters 
- LOCALE = *SYSVAL 
- SETJOBATR = ‘“SYSVAL 


system values 
- QLOCALE = /QSYS.LIB/MYLIB. 
LIB./XYZ.LOCALE 

- QSETJOBATR = *CCSID, 
*DATFMT, 
*DATSEP, 
*TIMSEP, 
*SRTISEQ 


- QCCSID = 00037 
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If a job ran based on the information in the figure, the following would be true: 
¢ The locale used would be XYZ. 


This is because the user profile parameter value for LOCALE was *SYSVAL. The *SYSVAL value is 
XYZ. 


¢ The CCSID would be based on the value specified when locale XYZ was created. 
This value is specified when the LOCALE object is created using the CRTLOCALE command. 
¢ The time separator would be derived from locale XYZ. 
This value is derived from the LC_TIME category specified in LOCALE XYZ. 
* The date format separator would be derived from locale XYZ. 
This value is derived from the LC_TIME category specified in LOCALE XYZ. 
¢ The data separator would be derived from locale XYZ. 
This value is derived from the LC_TIME category specified in LOCALE XYZ. 
¢ The decimal format character would be derived from locale XYZ. 
This value is derived from the LC_NUMERIC category specified in LOCALE XYZ. 


In the second example the user profile LOCALE parameter value is *SYSVAL and the SETJOBATR 
parameter values is *NONE. This means that the LOCALE value is determined by looking at the system 
value QLOCALE. When the SETJOBATR value is *NONE, job attributes are determined by the values in 
the user profile. 


Remember, because the user profile SETJOBATR parameter was *NONE, the system’s search resulted in 
using the values specified for QCCSID, QTIMSEP, QDATFMT, QSRTSEQ, and QDATSEP. 
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Job attributes (from user profile) 


- CCSID 

- TIMSEP 
- DATFMT 
- DATSEP 
- SRTSEQ 


Environment variable 
- LANG = /QSYS.LIB/MYLIB.LIB/ 
XYZ.LOCALE 


From QCCSID 

From QTIMSEP 
From QDATFMT 
From QDATSEP 
From QSRTSEQ 


User profile parameters 
- LOCALE 
- SETJOBATR 
- CCSID 
- SRTSEQ 


system values 


“SYSVAL 
*NONE 

“SYSVAL 
*SYSVAL 


- QLOCALE = xyz 


- QSETJOBATR = 


- QCCSID 


- QTIMSEP 


*CCSID, 

*“DATFMT, 
*DATSEP, 
*TIMSEP, 
*SRTSEQ 


00037 
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If a job ran based on the information in this example, the following would be true: 
* The locale used would be XYZ. 


This is because the user profile parameter value for LOCALE was *SYSVAL. The *SYSVAL value is 
XYZ. 


¢ The CCSID is 00037. 


This is because the user profile SETJOBATR parameter value was *NONE. The system search ended 
with the value for QCCSID being used. 


¢ The time separator is a colon (:). 


This is because the user profile SETJOBATR parameter value was *NONE. The system search ended 
with the value for QTIMSEP being used. 


* The date format separator is a slash (/). 


This is because the user profile SETJOBATR parameter value was *NONE. The system search ended 
with the value for QDATSEP being used. 


¢ The date format is month/day/year (MDY). 


This is because the user profile SETJOBATR parameter value was *NONE. The system search ended 
with the value for QDATFMT being used. 


¢ The decimal format character is a period. Zero suppression is performed. 


This is because the user profile SETJOBATR parameter value was *NONE. The system search ended 
with the value for QDECFMT being used. 


Example: Creating a locale: This example contains the steps necessary for creating a locale. The 
example also shows how to enable the locale. The steps are: 


1. Create (or have ) a library and a source physical file. 
2. Copy an existing locale source file definition member (to a library and source physical file). 


3. Edit the copied locale source file member if you need to customize any of the categories within the 
locale source. 


4. Create the locale object. 
5. Enable the locale object by using system values or parameters on the user profile. 


Step 1. Create a library and source physical file 


The library and source physical file are needed to store the locale source file member. See 
e for a list of the locale source file members that are shipped 


LD ( 
with OS/400. 
1. Type CRTLIB and press the F4 (prompt) key. 

2. Type localelib for the name of the library and press the Enter key. 


There is now a library called localelib. 


Next, create a source physical file. 
1. Type CRTSRCPF and press the F4 (prompt) key. 
2. Type localesrc for the file name and press the Enter key. 


There is now a source physical file (localesrc) created in library localelib. 


Step 2. Copy an existing locale source definition 
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IBM-supplied locale source definition file member are located in library QGYSLOCALE, source physical 
file QLOCALESRC. See ( j for a list of all IBM-supplied locale 
source files. In this example we wil copy member EN Us. a locale for the English language. 


1. Type CPYF and press the F4 (prompt) key. 
2. Type the values shown on the following display. 


|Copy File (CPYF) 


lei choices, press Enter. 


| 
| 
POM TTS: og. fee ed oh Se sew QLOCALESRC Name 
ETD RAY: * <6 aie G24. a a a QSYSLOCALE Name, *LIBL, *CURLIB 
TO. TING as vi Shwe He Rwy a eS LOCALESRC_ Name, *PRINT 
LADMARY” ssid: ist-e Bovey Bede ww & LOCALELIB_ Name, *LIBL, *CURLIB 
From member .........-. EN_US Name, generic*, *FIRST, *ALL | 
To member or label ....... EN_US Name, *FIRST, *FROMMBR 
[Replace or add records. .... *ADD_ *NONE, *ADD, *REPLACE 
Create file « aw i sk wa ws *YES *NO, *YES 
Record format field mapping .. *MAP__ *NONE, *NOCHK, *CVTSRC 
+------------------------ ~~ ee ot 


The values entered copy the EN_US member to the source physical file localesrc in library localelib. 


Note: When you copy a file that is tagged with a CCSID, you need to use the FMTOPT(*MAP) parameter 
to ensure that the copied source is converted to the CCSID of the “to file”. The FMTOPT parameter can be 
seen by scrolling ahead. 


Step 3. Edit the copied locale source definition 


If you want to use the IBM-supplied locale as it is shipped, you do not need to change it. You can go to 
the next step, create the locale object. However, in this example we will edit the EN_US member to set the 
time-of-day keywords used in the LC_TOD category. 


Note: The LC_TOD category is shipped with the keywords having no values. See the LC_TOD Category 
to view the LC_TOD source as shipped by IBM. 


In this example, we are using Source Entry Utility (SEU) to edit the locale. You can use SEU or an 
equivalent editor. 


1. Type STRSEU (Start Source Entry Utility) and press the F4 (prompt) key. 


2. Type the source file name (localesrc), library name (localelib), and source member name (EN_US) as 
shown on the following display. 


|Start Source Entry Utility (STRSEU) 
lord choices, press Enter. 


| 
sone FILE. gms a wae ew ee localesrc Name, *PRV 

| 

| 


| Library ei Ride itch Mah eS Ga ae localelib_ Name, *LIBL, *CURLIB, *PRV 
|Source member ......... EN_US Name, *PRV, *SELECT 
+---------------- 6 ee + 


Hen bee Chetaee ese ek eae oats ese eee Sees senor se eee aeeeegoeter nee esse seseseee= + 
|COLUMNS . . . : 1 71 EDIT LOCALELIB/LOCALESRC | 
SEU==> F_LC_TOD EN_US 
FMT 4% sicattioes 1 satis 2 sashes J attiae © eaetvae 5 sects O siete 7 


[Fone eats dik BEGINNING OF DATA 2X *%*# x RRA KEK K KAA KEE KKK EKER ERE KERERE | 
Eee comment_char <percent-sign> | 
5968.00 escape_char <slash> 

[5969.00 | 
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5971.00 % 5716SS1 (C) COPYRIGHT IBM CORP. 1991,1996 | 
[5972.00 % ALL RIGHTS RESERVED. | 
Fa % US GOVERNMENT USERS RESTRICTED RIGHTS - | 
5974.00 % USE, DUPLICATION OR DISCLOSURE RESTRICTED 
ee 00 % BY GSA ADP SCHEDULE CONTRACT WITH IBM CORP. | 
5976.00 % 
a % LICENSED MATERIALS-PROPERTY OF IBM | 
5978.00 % 
|5979.00 % FILE NAME : EN_US | 
ar 00 % | 
5981.00 % COUNTRY/REGION: UNITED STATES 
a % | 
cee F4=PROMPT F5=REFRESH F9=RETRIEVE FI10=CURSOR F11=TOGGLE 
F16=REPEAT FIND F17=REPEAT CHANGE F24=MORE KEYS 
| (C) COPYRIGHT IBM CORP. 1981, 1996. | 
+----------------- = 5 a + 


4. Use the SEU search function to locate LC_TOD. After the search completes, the display below 
appears. 
As you can see, all LC_TOD category keywords have values of 0 and no descriptive names declared 
for thame and dstname. 


6520.00 LC_TOD 


| 
6522.00 tzdiff 0 | 
6523.00 tname me 
6524.00 dstname "" | 
6525.00 dststart 0,0,0,0 | 
6526.00 dstend 0,0,0,0 | 
6527.00 dstshift 0 | 
| 
| 


6529.00 END LC_TOD 


[ Sees ie eda ek ae eae END OF DATA *&X XXX AKA KRKKEKKKKR KEKE RK EK ERE KK REKERREERERE 


|F3=EXIT F4=PROMPT F5=REFRESH F9=RETRIEVE F10=CURSOR F11=TOGGLE 
F16=REPEAT FIND F17=REPEAT CHANGE F24=MORE KEYS 
STRING LC_TOD FOUND. 


5. Type the following values for the LC_TOD keywords. If you need more detailed information about this 
category, see Ec Top Categonl 


tzdiff Time zone difference in minutes. Type -360. This is the difference in the number of minutes 
between Greenwich mean time and the central time zone of the United States. 


tname Specifies the string used for the time zone name. Type “<C><E><N><T><R><A><L>”. 


dstname 
The string used for the daylight savings time name. Type “<C><D><T>” to mean Central 
Daylight Time. 


dststart 
Four integers representing the start date for daylight savings time. Type 4,0,23,0. This string of 
integers means that daylight savings time starts the fourth month of the year, the twenty-third 
day of the month, and 0 seconds after midnight local standard time. 
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dstend 


The SEU edit screen should match the screen below. 
Press the Enter key to make the changes to the EN_US locale member. 


+------------------------------ - ~~ - + 
|COLUMNS . 1 71 EDIT LOCALELIB/LOCALESRC 

SEU==> EN_US 

EMT * cacthce LD wncPeea 2 anctoae 3) was | 
|6519.00 | 
6520.00 LC_TOD | 
6521.00 

6522.00 tzdiff  -360 

6523.00 tname "<C><E><N><T><R><A><L>" | 
6524.00 dstname "<C><D><T>" | 
6525.00 dststart 4,0,23,0 

[6526.00 dstend 10,3,6,0 
6527.00 dstshift 3600 

6528.00 

6529.00 END LC_TOD 


F3=EXIT 
|F16=REPEAT FIND 


KKKKKKKEKKKKEKKKEK END OF DATA 2x4 xXx RK KK EKER KK KKK EK EKER EK ERE RK EKER 


F4=PROMPT  F5=REFRESH F9=RETRIEVE 


F17=REPEAT CHANGE 


|STRING LC_TOD FOUND. 


7. Save the member and exit SEU. 


You have now copied the IBM-supplied locale source member and customized the LC_TOD category. 


Step 4. Create the locale object 


Type CRTLOCALE on any command line and press the F4 (prompt) key. The Create Locale 
(CRTLOCALE) screen appears. 


1. 


2. 


Type the locale path name (path name includes the location of as well as the name of the locale 


source member) in the locale name field. 


Four integers representing the end date for daylight savings time. Type 10,3,6,0. This string of 
integers means that daylight savings time ends the tenth month of the year, the beginning of 
the third week, the sixth day of the week, and 0 seconds after midnight local standard time. 


dstshift 
An integer value representing the daylight savings time shift in seconds. Type 3600. 


Next, type the source file path name (location and name of the source physical file and the member 
name) in the source file path name field. 


Notes: 


a. Make sure that the locale source file you are using has the same characters defined as does the 
CCSID you specify on the CRTLOCALE command. If they do not have the same characters 


available, unpredictable results can occur. For example, the EN_US source file has lower case 


letters defined. However, the lower case letters are not available in CCSID 290. 


b. When creating locale objects, it is recommended to make the CCSID part of the locale object 


name. For example, EN_US created with CCSID 37 could be named EN_US37. 


|Create Locale (CRTLOCALE) 


Type choices, press Enter. 
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pee mame . www we we ee > '/qsys.lib/localelib.lib/en_us.locale' 


[Source file path name ..... > '/qsys.lib/localelib.lib/localesrc.file/en | 
_us.mbr' 

Coded character set ID..... > 37 1-65533, *JOB 

Generation severity level ... 10 10, 20 

Text 'description’....... my version of locale EN_US - contains my c 
hanges__ | 
| | 
Fee | 
|F3=Exit F4=Prompt F5=Refresh F10=Additional parameters F12=Cancel | 
tai to use this display F24=More keys | 
| | 
+-------------------------- = - + 


4. Press the Enter key to complete the creation of the locale object named EN_US in the library 
LOCALELIB 


Step 5. Enable the locale object 


Locales can be enabled system-wide using the QLOCALE system value or for individual users by 
changing their user profile. To enable system-wide, make EN_US the value for QLOCALE. In this example 
we will enable locale support for one user. 

1. Type CHGUSRPRF and press the F4 (prompt) key. 

2. Specify your userid and then press the Enter key. 


In the portion of the Change User Profile display shown below, the LOCALE parameter now has a 
value indicating that EN_US is the specified locale to be used by your userid. 


+ for more values 
LOGAN: a: ase vias ee de ew ee ne QSYS.LIB/LOCALELIB.LIB/EN_US.LOCALE 


After your user profile has been changed, any jobs initiated by your userid have the EN_US locale 
associated with those jobs. The LANG environment variable is also initialized to the name of the locale. 


Example: Producing unique monetary formats: A unique customized monetary format can be 
produced by changing the value of a single statement. For example, the following table shows the results 
of using all combinations of defined values for the p_cs_precedes, p_sep_by_space, and p_sign_posn 
statements: 


p_cs_precedes value |p_sign_posn value |p_sep_by_space=2 |p_sep_by_space=3 | p_sep_by_space=4 
p_cs_precedes = 1 p_sign_posn = 0 ($1.25) ($ 1.25) ($1.25) 
p_sign_posn = 1 + $1.25 +$ 1.25 +$1.25 
p_sign_posn = 2 $1.25 + $ 1.25+ $1.25+ 
p_sign_posn = 3 + $1.25 +$ 1.25 +$1.25 
p_sign_posn = 4 $ 4+1.25 $+ 1.25 $41.25 


156 iSeries: Globalization (Handle data in global applications) 


p_cs_precedes value |p_sign_posn value |p_sep_by_space=2 |p_sep_by_space=3 | p_sep_by_space=4 
p_cs_precedes = 0 p_sign_posn = 0 (1.25 $) (1.25 $) (1.25$) 

p_sign_posn = 1 +1.25 $ +1.25 $ +1.25$ 

p_sign_posn = 2 1.25$ + 1.25 $+ 1.25$+4 

p_sign_posn = 3 1.254+ $ 1.25 +$ 1.254+$ 

p_sign_posn = 4 1.25$ + 1.25 $+ 1.25$+ 


Example: Locales as part of a multilingual environment: OS/400, through the use of locales, user 
profiles, and subsystems, can provide a multilingual environment. Users of a system setup for multilingual 
environments work with their national language and all its cultural conventions (for example, the character 
used to separate hours, minutes, and seconds). 


Assume the system used in this example has its primary language defined as English and the secondary 
national language versions (NLVs) for French and Spanish have been installed. 


Follow the steps in this example to: 

* Create the locales for English, French, and Spanish 

* Create user profiles for users named: English, French, and Spanish 

* Create separate subsystems for French and Spanish language users. 


Step 1. Create locales 
1. Type CRTLOCALE and press the Prompt key (F4). 
2. Enter the following values for the fields listed below: 
¢ Locale name: qsys.lib/localelib.lib/en_us.locale 
* Source file path name: qsys.lib/qsyslocale.lib/qlocalesrc.file/en_us.mbr 
* Coded character set ID: 37 
* Generation severity level: 20 
* Text ‘description’: US English locale 
3. Press Enter. 


Repeat the CRTLOCALE command for the FRENCH and SPANISH locales, using the following values for 
the fields listed below. 


For the French locale: 

* Locale name: qsys.lib/localelib.lib/fr_fr.locale 

* Source file path name: qsys.lib/qsyslocale.lib/qlocalesrc.file/fr_fr:mbr 
* Coded character set ID: 297 

* Generation severity level: 20 

* Text description’: French locale 


For the Spanish locale: 

¢ Locale name: qsys.lib/localelib.lib/es_es.locale 

¢ Source file path name: qsys.lib/qsyslocale.lib/qlocalesrc.file/es_es.mbr 
* Coded character set ID: 284 

* Generation severity level: 20 

* Text ‘description’: Spanish locale 
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You have created three locales (EN_US (English US), FR_FR (French), and ES_ES (Spanish). They are 
stored in library localelib.lib. 


Step 2. Create the user profile 


In this example three user profiles are created; each one will use one of locales we just created. The user 
profile names are: English, French, and Spanish. 


1. Type CRTUSRPRF and prompt (F4). 
2. Type ENGLISH for the User profile parameter value 
3. Scroll forward until you see the Locale job attributes parameter and the Locale parameter. 
4. Type: 
¢ /qsys.lib/localelib.lib/en_us.locale for the Locale parameter value. 
* Type + for the Locale job attributes parameter value and press Enter. Type: 


*CCSID 
*DATFMT 
*DATSEP 
*TIMSEP 
*“SRTSEQ 
*DECFMT 


Note: At job start up the system finds the actual job attribute values defined in the locale object. The 
job attributes found in the locale override the values specified in the user profile fields for the CCSID 
and SRTSEQ parameters. They also override the Date Format, Date Separator, and Time Separator 
job attributes specified in any system value. 


5. Press Enter. You have now created the user profile for a user named ENGLISH. 


Repeat the CRTUSRPRF command for user IDs FRENCH and SPANISH. The next two displays provide 
the correct Locale parameter and Locale job attribute information for creating the user profiles for 
FRENCH and SPANISH. 


Locale job attributes ..... > xCCSID *SYSVAL, *NONE, *CCSID... 

> *DATFMT 

> *DATSEP 

> *TIMSEP | 
| 
| 


|> *SRTSEQ 
[locale . see ac ee eae ee > '/qsys.lib/localelib.lib/fr_fr.locale' 


Locale job attributes ..... > *CCSID *SYSVAL, *NONE, *CCSID... 

> *DATFMT 

[> *DATSEP | 

> *TIMSEP | 
| 


> *SRTSEQ 
LOCATE is. sim we Te we Gee ew > '/qsys.lib/localelib.lib/es_es.locale' 


Step 3. Creating subsystems for each national language version 


Subsystems can be tailored to provide users an environment in which they see their own national 
language with data presented in the cultural format and conventions they are used to seeing. 


Note: Since the primary language of the system is English, we do not need to create a subsystem for 
English. 
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1. Type CRTSBSD and prompt (F4). 


2. Specify values for the following parameters to ensure that the subsystem is enabled for a specific 
national language (such as French and Spanish in our example). 


* Subsystem description 
This can be any name you choose. 
* Text ’description’ 
The description can be anything you want it to be. 
* Sign-on display file and Library 
This often is QDSIGNON. The important information here is to know the name of the library where 
the national language version (French in this example) is stored. 
* Subsystem library 


Specifies a library that is entered ahead of other libraries in the library list of jobs started in this 
subsystem. This parameter allows you to use a secondary language library causing messages and 
displays to appear in your spoken language. 


Note: The correct values for Sign-on display file library and Subsystem library parameters are 
determined by adding QSYS to the national language version feature code. For example: the French 
national language library is named QSYS2928. 


odes for a listing of all supported language versions. 


The screen below shows the correct values to ensure that users of the FRENCH subsystem interact 
with the computer in the French language. 


Create Subsystem Description (CRTSBSD) 


oe choices, press Enter. | 


Subsystem description ..... SBSD > FRENCH 

LIDMANY cee ee de He Ben oe *CURLIB 
+----------------------- ~~ a a 
+----------------------- ~~ + 
Text 'description’....... TEXT > 'Subsystem for French users' 


ae Parameters 


Sign-on display file. ..... SGNDSPF > QDSIGNON 
LADRARY <a “esx: cg 2: 82 Ge Bee > QSYS2928 
Subsystem library ....... SYSLIBLE > QSYS2928 
More... 


| F3=Exit F4=Prompt F5=Refresh F12=Cancel F13=How to use this display 
pee keys 


3. Press Enter. 
Step 4. More information about subsystems 


Creation of a subsystem requires additional work such as: 
* Setting subsystem attributes 

¢ Adding workstation entires 

¢ Adding job queue entries 


¢ Adding communications entires (if your national language users are attached over communications 
lines) 
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¢ Adding autostart job entires if you want to use this feature 
¢ Adding prestart job entires if you want to use this feature 
* Creating a class 

¢ Adding routing entries 


How to perform the tasks in the list above is not described in this example. For more information about 
Subsystems, see the Work Managemeni topic in the Information Center. 


Example: POSIX locale: The POSIX (or C) locale follows. It is published in its entirety because: 
* It provides a locale example with source provided for all categories. 
* If you have not set a locale value in your C application program, the default POSIX locale is then used. 


In either case, in the listing below, you are able to look at the locale categories and view the source. 


comment_char <percent-sign> 
escape_char <slash> 


5716SS1 (C) COPYRIGHT IBM CORP. 1991,1996 
ALL RIGHTS RESERVED. 

US GOVERNMENT USERS RESTRICTED RIGHTS - 
USE, DUPLICATION OR DISCLOSURE RESTRICTED 
BY GSA ADP SCHEDULE CONTRACT WITH IBM CORP. 
LICENSED MATERIALS-PROPERTY OF IBM 

FILE NAME : POSIX 

COUNTRY/REGION: POSIX DEFAULT LOCALE 
LANGUAGES(S): NOT SPECIFIED 


DESCRIPTION: LOCALE SOURCE DEFINITION FILE. 


d© d© de dO BE BO BE BE DE DO DE AS DO BO AO OO SO 


LC_CTYPE 


upper <A>;<B>;<C>;<D>;<E>;<F>3<G>3;<H>;<I>3;<J>3;<K>3;<L>;<M>;/ 
<N>3<0>;<P>3<Q>;<R>;<S>3<T>3;<U>;<V>;<W>3;<X>;<Y>;<Z> 


lower <a>; <b>;<c>;<d>;<e>;<f>;<g>3<h>;<i>3<j>;<k>3<1>;<m>;/ 
<n>3;<0>;<p>3;<q>3<r>3;<s>3<t>;<u>;<v>3<w>;<x>3<y>3<z> 


Space <tab>;<newl ine>;<vertical -tab>;<form-feed>;<carriage-return>; / 
<space> 


cntrl <NUL>;<SOH>;<STX>;<ETX>;<EOT>;<ENQ>;<ACK>;<alert>;<backspace>; / 
<tab>;<newl ine>;<vertical-tab>;<form-feed>;<carriage-return>; / 
<S0>;<SI>;<DLE>;<DC1>;<DC2>;<DC3>;<DC4>;<NAK>;<SYN>;<ETB>;/ 

<CAN>; <EM>; <SUB>;<ESC>;<IS4>;<IS3>;<IS2>;<IS1>;<DEL> 


punct <exclamation-mark>;<quotation-mark>;<number-sign>;/ 
<dollar-sign>;<percent-sign>;<ampersand>;<apostrophe>; / 
<left-parenthesis>;<right-parenthesis>;<asterisk>;<plus-sign>;/ 
<comma>;<hyphen>;<period>;<slash>;/ 
<colon>;<semicolon>;<less-than-sign>;/ 
<equals-sign>;<greater-than-sign>;<question-mark>; / 
<commercial-at>;/ 

<left-square-bracket>;<backslash>; / 
<right-square-bracket>;<circumflex>; / 
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<underscore>;<grave-accent>; / 
<left-curly-bracket>;<vertical-line>;<right-curly-bracket>;/ 


<tilde> 


digit <zero>;<one>;<two>;<three>;<four>;/ 
<five>;<six>;<seven>;<eight>;<nine> 


xdigit <zero>;<one>;<two>;<three>;<four>;/ 
<five>;<six>;<seven>;<eight>;<nine>;/ 
<A>;<B>;<C>;<D>;<E>;<F>;/ 
<a>;<b>;<c>;<d>;<e>;<f> 


blank <space>;/ 
<tab> 


toupper (<a>,<A>) ; (<b>,<B>) ; (<c>,<C>) ; (<d>,<D>) ; (<e>,<E>);/ 
(<f>,<F>) ; (<g>,<G>) ; (<h>,<H>) ; (<i>,<I>) 3 (<j>,<J>);/ 
(<k>,<K>) ; (<1>,<L>) ; (<m>,<M>) ; (<n>,<N>) ; (<o>,<0>) ;/ 
(<p>,<P>) ; (<q>,<Q>) ; (<r>,<R>) ; (<s>,<S>) 3 (<t>,<T>)3/ 
(<u>, <U>) ; (<v>,<V>) 3 (<w>,<W>) 3 (<x>,<X>) 3 (<y>,<Y>)3/ 
( 


<z>,<Z>) 


tolower (<A>,<a>) ; (<B>,<b>) ; (<C>,<c>) ; (<D>,<d>) ; (<E>,<e>) ;/ 
(<F>,<f>) ; (<G>,<g>) ; (<H>,<h>) ; (<I>,<i>) ; (<J>,<j>)3/ 
(<K>,<k>) ; (<L>,<1>) ; (<M>,<m>) ; (<N>,<n>) ; (<0>,<o>) ;/ 
(<P>,<p>) ; (<Q>,<q>) ; (<R>,<r>) ; (<S>,<s>) ; (<T>,<t>) ;/ 
(<U>, <u>) ; (<V>,<v>) 3 (<W>,<w>) 3 (<X>,<x>) 3 (<Y>,<y>)3/ 
( 


END LC_CTYPE 
LC_COLLATE 
order_start 


<NUL> 

<SOH> 

<STX> 

<ETX> 

<E0OT> 

<ENQ> 

<ACK> 
<alert> 
<backspace> 
<tab> 

<newl ine> 
<vertical-tab> 
<form-feed> 
<carriage-return> 
<S0> 

<SI> 

<DLE> 

<DC1> 

<DC2> 

<DC3> 

<DC4> 

<NAK> 

<SYN> 

<ETB> 

<CAN> 

<EM> 

<SUB> 

<ESC> 

<IS4> 

<IS3> 

<IS2> 

<IS1> 
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<space> 
<exclamation-mark> 
<quotation-mark> 
<number-sign> 
<dollar-sign> 
<percent-sign> 
<ampersand> 
<apostrophe> 
<left-parenthesis> 
<right-parenthesis> 
<asterisk> 
<plus-sign> 
<comma> 

<hyphen> 
<period> 

<slash> 

<zero> 

<one> 

<two> 

<three> 

<four> 

<five> 

<six> 

<seven> 

<eight> 

<nine> 

<colon> 
<semicolon> 
<less-than-sign> 
<equals-sign> 
<greater-than-sign> 
<question-mark> 
<commercial-at> 
<A> 

<B> 

<C> 

<D> 

<E> 

<F> 

<G> 

<H> 

<[> 

<J> 

<K> 

<L> 

<M> 

<N> 

<0> 

<P> 

<Q> 

<R> 

<S> 

<T> 

<U> 

<\> 

<W> 

<X> 

<Y> 

<7> 
<left-square-bracket> 
<backslash> 
<right-square-bracket> 
<circumf1lex> 
<underscore> 
<grave-accent> 
<a> 

<b> 
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<c> 
<d> 

<e> 

<f> 

<g> 

<h> 

<i> 

<j> 

<k> 

<|> 

<m> 

<n> 

<o> 

<p> 

<q> 

<> 

<s> 

<t> 

<u> 

<y> 

<w> 

<xX> 

<y> 

<z> 
<left-curly-bracket> 
<vertical-line> 
<right-curly-bracket> 
<tilde> 

<DEL> 

UNDEFINED 


order_end 
END LC_COLLATE 
LC_MONETARY 


int_curr_symbol "" 
currency_symbol "" 
mon_decimal_point "" 
mon_thousands_sep "" 
mon_grouping -1 
positive_sign "" 
negative_sign 
int_frac_digits -1 
frac_digits -1 
p_cs_precedes -1 
p_sep_by_space -1l 
n_cs_precedes -1 
n_sep_by_space -1 
p_sign_posn -1 
n_sign_posn -1 


END LC_MONETARY 
LC_NUMERIC 


decimal_point "<period>" thousands_sep 
"" grouping -1 


END LC_NUMERIC 
LC_TIME 
abday "<S><u><n>";/ 


"<M><o><n>"5 / 
"<T><u><e>"5/ 
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"<W><e><d>"; / 
"<T><h><u>";/ 
"<Fo<p><i>"3/ 
"<S><qa><t>" 


day "<S><u><n><d><a><y>";/ 
"<M><o><n><d><a><y>"; / 
"<T><u><e><s><d><a><y>"3 / 
"<W><e><d><n><e><s><d><a><y>";/ 
"<T><h><u><r><s><d><a><y>"3/ 
"<Fo<p><j><d><a><y>"3 / 
"<S><a><t><u><r><d><a><y>" 


abmon "<J><a><n>";/ 
"<F><e><b>";/ 
"<M><a><r>"; / 
"<A><p><r>"; / 
"<M><a><y>"'; / 
"<J><u><n>"3/ 
"<J><u><]>"3/ 
"<A><u><g>";/ 
"<S><e><p>";/ 
"<Q><c><t>";/ 
"<N><o><v>"; / 
"<D><e><c>" 


mon "<J><a><n><u><a><r><y>"; / 
"<F><e><b><r><u><a><r><y>"3 / 
"<M><a><r><c><h>"; / 
"<A><p><r><i><1>"3/ 
"<M><a><y>"3 / 
"<J><u><n><e>"; / 
"<J><u><]><y>"3/ 
"<A><u><g><u><s><t>"; / 
"<$><e><p><t><e><m><b><e><r>"; / 
"<Q><c><t><o><b><e><r>"3/ 
"<N><o><v><e><m><b><e><r>"; / 
"<D><e><c><e><m><b><e><r>" 


d_t_fmt "%a %b %d %H:%M:%S %Z %Y" 
d_fmt "%m//%d//%y" 

t_fmt "%H:%M:%S" 

am_pm "<A><M>"; "<P><M>" 
t_fmt_ampm "%1:%M:%S %p" 
END LC_TIME 

LC_MESSAGES 

yesexpr "[yY] [eE] [sS] | [yY]" 
noexpr "[nN] [00] | [nN]" 
yesstr "yes" 

nostr "no" 

END LC_MESSAGES 

LC_TOD 


tzdiff 0 
tname "" 
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dstname "" 
dststart 0,0,0 
dstend 0,0,0,0 
dstshift 0 


0 


END LC_TOD 


Example: EN_US locale: The EN-US locale follows. In the example below, you can look at the locale 


categories and view the source. 


comment_char <percent-sign> 
escape_char <slash> 


5716SS1 (C) COPYRIGHT IBM CORP. 1991,1996 
ALL RIGHTS RESERVED. 

US GOVERNMENT USERS RESTRICTED RIGHTS - 

USE, DUPLICATION OR DISCLOSURE RESTRICTED 

BY GSA ADP SCHEDULE CONTRACT WITH IBM CORP. 
LICENSED MATERIALS-PROPERTY OF IBM 

FILE NAME : EN_US 

COUNTRY/REGION: UNITED STATES 

LANGUAGES(S): ENGLISH 


DESCRIPTION: LOCALE SOURCE DEFINITION FILE. 


d© d©& do dE AS BO BO BE BO DE BE DE BO DE BO DO BO 


LC_CTYPE 


upper <A>3;<B>;<C>;<D>;<E>;<F>3;<G>3;<H>;<I>3;<J>;<K>;<L>;<M>;/ 
<N>3;<O>;<P>3;<Q>;<R>;<S>3<T>;<U>;<V>3<W>3<X>3<Y>3<Z>;/ 
<A-acute>;<A-grave>;<A-circumflex>;<A-diaresis>; / 
<A-tilde>;<A-ring>;<AE>;<C-cedilla>;<Eth>;<E-acute>; / 
<E-grave>;<E-circumflex>;<E-diaresis>;<I-acute>; / 
<I-grave>;<I-circumflex>;<I-diaresis>;<N-tilde>;/ 
<0-acute>;<0-grave>;<0-circumflex>;<0-diaresis>;/ 
<0-tilde>;<0-slash>;<Thorn>;<U-acute>;<U-grave>; / 
<U-circumflex>;<U-diaresis>;<Y-acute> 


lower <a>; <b>3;<c>;<d>;<e>;<f>;<g>3<h>3<i>3<j>;<k>3<1>;<m>;/ 
<n>3;<0>;<p>;<q>3<r>3<S>3<t>3<u>3<v>3<w>3<x>3;<y>3;<z>3/ 
<a-acute>;<a-grave>;<a-circumflex>;<a-diaresis>; / 
<a-tilde>;<a-ring>;<ae>;<c-cedilla>;<eth>;<e-acute>; / 
<e-grave>;<e-circumflex>;<e-diaresis>;<i-acute>;/ 
<i-grave>;<i-circumflex>;<i-diaresis>;<n-tilde>;/ 
<o-acute>;<o-grave>;<o-circumflex>;<o-diaresis>;/ 
<o-tilde>;<o-slash>;<s-sharp>;<thorn>;<u-acute>; / 
<u-grave>;<u-circumflex>;<u-diaresis>;<y-acute>; / 

<y-diaresis> 


space <tab>;<newl ine>;<vertical-tab>;<form-feed>;<carriage-return>; / 
<space> 


cntrl <NUL> 3; <SOH>; <STX>;<ETX>3;<EOT>; <ENQ>;<ACK>;<alert>;<backspace>; / 
<tab>;<newl ine>;<vertical-tab>;<form-feed>;<carriage-return>; / 
<S0>;<SI>;<DLE>;<DC1>;<DC2>;<DC3>;<DC4>;<NAK>;<SYN>;<ETB>; / 

<CAN>; <EM>; <SUB>;<ESC>;<IS4>;<IS3>;<IS2>;<IS1>;<DEL>; / 
<DS>;<SOS>3;<FS>;<WUS>;<BYP>;<NL>;<RNL>;<POC>;<SA>3;<SFE>;<SM>; / 
<CSP>;<MFA>;<SPS>3;<RPT>;<CU1>;<DCS>;<PU1>;<UBS>;<IR>;<PP>;/ 
<TRN>;<NBS>;<GE>;<SBS>;<IT>;<RFF>;<CU3>;<SEL>;<RES>;<PM>;<E0> 


graph <excl amation-mark>;<quotation-mark>;<number-sign>; / 
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<dollar-sign>;<percent-sign>;<ampersand>;<apostrophe>; / 
<left-parenthesis>;<right-parenthesis>;<asterisk>;<plus-sign>;/ 
<comma>;<hyphen-minus>;<period>;<slash>;/ 
<zero>;<one>;<two>;<three>;<four>;<five>;<six>;<seven>; / 
<eight>;<nine>;<colon>;<semicolon>;<less-than-sign>; / 
<equals-sign>;<greater-than-sign>;<question-mark>; / 
<commercial-at>;<A>;<B>;<C>;<D>;<E>;<F>;<G>;<H>;<I>;<J>;<K>;/ 
<L>3;<M>;<N>3;<0>;<P>;<Q>3;<R>;<S>3<T>3<U>;<V>3<W>;<X>3<Y>3<Z>;/ 
<left-square-bracket>;<backslash>; / 
<right-square-bracket>;<circumflex>; / 
<underscore>;<grave-accent>; / 

<a>; <b>;<c>;<d>;<e>;<f>;<g>3<h>;<i>;<j>;<k>3;<1>;<m;/ 
<n>3<0>;<p>;<q>3<r>3<S>3<t>3<u>3<v>3<w>3<x>3;<y>3;<z>;/ 
<left-brace>;<vertical-line>;<right-brace>; / 
<tilde>;<C-cedilla>;<u-diaresis>;<e-acute>;<a-circumflex>;/ 
<a-diaresis>;<a-grave>;<a-ring>;<c-cedilla>;<e-circumflex>;/ 
<e-diaresis>;<e-grave>;<i-diaresis>;<i-circumflex>;/ 
<i-grave>;<A-diaresis>;<A-ring>;<E-acute>;<ae>;<AE>; / 
<o-circumf]ex>;<o-diaresis>;<o-grave>;<u-circumflex>; / 
<u-grave>;<y-diaresis>;<0-diaresis>;<U-diaresis>;<o-slash>;/ 
<sterl ing>;<0-slash>;<multiply>;<a-acute>;<i-acute>;/ 
<o-acute>;<u-acute>;<n-tilde>;<N-tilde>;<feminine>;/ 

<mascul ine>;<question-down>;<registered>;<not>;<one-hal f>;/ 
<one-quarter>;<exclamation-down>;<guillemot-left>;/ 
<guillemot-right>;<A-acute>;<A-circumflex>;<A-grave>; / 
<copyright>;<cent>;<yen>;<a-tilde>;<A-tilde>;<currency>; / 
<eth>;<Eth>;<E-circumflex>;<E-diaresis>;<E-grave>;/ 
<]-acute>;<I-circumflex>;<I-diaresis>;<broken-bar>;/ 
<I-grave>;<0-acute>;<s-sharp>;<0-circumflex>; / 
<0-grave>;<o-tilde>;<0-tilde>;<mu>;<thorn>;<Thorn>;<U-acute>; / 
<U-circumf]ex>;<U-grave>;<y-acute>;<Y-acute>;<macron>; / 
<acute>;<hyphen>;<plus-minus>;<three-quarters>;<paragraph>; / 
<section>;<divide>;<cedilla>;<degree>;<diaresis>;<dot>;/ 
<one-superior>;<three-superior>;<two-superior> 


print <space>;<exclamation-mark>;<quotation-mark>;<number-sign>; / 
<dollar-sign>;<percent-sign>;<ampersand>;<apostrophe>; / 
<left-parenthesis>;<right-parenthesis>;<asterisk>;<plus-sign>;/ 
<comma>;<hyphen-minus>;<period>;<slash>;/ 
<zero>;<one>;<two>;<three>;<four>;<five>;<six>;<seven>; / 
<eight>;<nine>;<colon>;<semicolon>;<less-than-sign>; / 
<equals-sign>;<greater-than-sign>;<question-mark>; / 
<commercial-at>;<A>;<B>;<C>;<D>;<E>;<F>;<G>;<H>;<I>;<J>;<K>;/ 
<L>3;<M>;<N>3;<0>3;<P>;<Q>3;<R>;<S>3<T>3<U>;<V>3<W>;<X>3<Y>3<Z>;/ 
<left-square-bracket>;<backslash>; / 
<right-square-bracket>;<circumflex>; / 
<underscore>;<grave-accent>; / 

<a>; <b>;<c>;<d>;<e>;<f>;<g>3<h>;<i>;<j>;<k>3;<1>;<m>;/ 
<n>3<0>;<p>;<q>3<r>3<S>3<t>3<u>3;<v>3<w>3<x>3;<y>3;<z>;/ 
<left-brace>;<vertical-line>;<right-brace>;/ 
<tilde>;<C-cedilla>;<u-diaresis>;<e-acute>;<a-circumflex>;/ 
<a-diaresis>;<a-grave>;<a-ring>;<c-cedilla>;<e-circumflex>;/ 
<e-diaresis>;<e-grave>;<i-diaresis>;<i-circumflex>;/ 
<i-grave>;<A-diaresis>;<A-ring>;<E-acute>;<ae>;<AE>; / 
<o-circumflex>;<o-diaresis>;<o-grave>;<u-circumflex>; / 
<u-grave>;<y-diaresis>;<0-diaresis>;<U-diaresis>;<o-slash>;/ 
<sterl] ing>;<0-slash>;<multiply>;<a-acute>;<i-acute>;/ 
<o-acute>;<u-acute>;<n-tilde>;<N-tilde>;<feminine>;/ 

<mascul ine>;<question-down>;<registered>;<not>;<one-hal f>;/ 
<one-quarter>;<exclamation-down>;<guillemot-left>;/ 
<guillemot-right>;<A-acute>;<A-circumflex>;<A-grave>; / 
<copyright>;<cent>;<yen>;<a-tilde>;<A-tilde>;<currency>; / 
<eth>;<Eth>;<E-circumflex>;<E-diaresis>;<E-grave>;/ 
<]-acute>;<I-circumflex>;<I-diaresis>;<broken-bar>;/ 
<I-grave>;<0-acute>;<s-sharp>;<0-circumflex>; / 
<0-grave>;<o-tilde>;<0-ti1lde>;<mu>;<thorn>;<Thorn>;<U-acute>; / 
<U-circumf]ex>;<U-grave>;<y-acute>;<Y-acute>;<macron>; / 
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<acute>;<hyphen>;<plus-minus>;<three-quarters>;<paragraph>; / 
<section>;<divide>;<cedilla>;<degree>;<diaresis>;<dot>;/ 


<one-superior>;<three-superior>;<two-superi 


punct <exclamation-mark>;<quotation-mark>;<number-sign>; / 


or> 


<dollar-sign>;<percent-sign>;<ampersand>;<apostrophe>; / 


<left-parenthesis>;<right-parenthesis>;<asterisk>;<plus-sign>;/ 


<comma>;<hyphen-minus>;<period>;<slash>;/ 
<colon>;<semicolon>;<less-than-sign>; / 


<equals-sign>;<greater-than-sign>;<question-mark>; / 


<commercial-at>;/ 
<left-square-bracket>;<backslash>;/ 
<right-square-bracket>;<circumflex>; / 
<underscore>;<grave-accent>; / 


<left-brace>;<vertical-line>;<right-brace>; 


<tilde> 


digit <zero>;<one>;<two>;<three>;<four>; 


<five>;<six>;<seven>;<eight>;<nine> 


xdigit <zero>;<one>;<two>;<three>;<four>; 


<five>;<six>;<seven>;<eight>;<nine>;/ 
<A>;<B>;<C>;<D>;<E>;<F>;/ 
<a>; <b>;<c>;<d>;<e>;<f> 


blank <space>; / 
<tab> 


/ 


/ 


/ 


<D>) ; (<e>,<E>) ;/ 


<f>,<F>) ; (<g>,<G>) ; (<h>,<H>) ; (<i>,<I>) 3 (<j>,<J>);/ 
<k>,<K>) 3 (<1>,<L>) ; (<m>,<M>) ; (<n>,<N>) ; (<o>,<0>) ;/ 
<p>, <P>) ; (<q>,<Q>) ; (<r>,<R>) 3 (<s>,<S>) 3 (<t>,<T>) 3/ 
<u>, <U>) ; (<v>,<V>) 3 (<w>,<W>) ; (<x>,<X>) 3 (<y>,<Y>) 3/ 


<z>,<Z>) ; (<a-acute>,<A-acute>) ; (<a-grave>, 
<a-circumflex>,<A-circumf]ex>) ; (<a-diaresi 


<A-grave>) ;/ 
s>,<A-diaresis>) ;/ 


<a-tilde>,<A-tilde>) ; (<a-ring>,<A-ring>) ; (<ae>,<AE>) ;/ 


<c-cedilla>,<C-cedilla>) ; (<eth>,<Eth>) ; (<e-acute>,<E-acute>) ;/ 


rcumflex>) ;/ 


<e-diaresis>,<E-diaresis>) ; (<i-acute>,<I-acute>) ;/ 


<i-grave>,<I-grave>) ; (<i-circumflex>,<I-ci 


rcumflex>) ;/ 


<i-diaresis>,<I-diaresis>) ;(<n-tilde>,<N-tilde>);/ 


<o-acute>,<0-acute>) ; (<o-grave>,<0-grave>) 
<o-circumflex>,<0-circumflex>) ; (<o-diaresi 


<o-tilde>,<0-tilde>) ; (<o-slash>,<0-slash>) ; (<thorn>,<Thorn>) ;/ 


<u-acute>,<U-acute>) ; (<u-grave>,<U-grave>) 
<u-circumflex>,<U-circumf1ex>) ; (<u-diaresi 
<y-acute>,<Y-acute>) ; (<y-diaresis>,<Y>) 


( 
( 
( 
( 
( 
( 
(<e-grave>,<E-grave>) ; (<e-circumflex>,<E-ci 
( 
( 
( 
( 
( 
( 
( 
( 
( 


tolower (<A>,<a>) ; (<B>,<b>) ; (<C>,<c>) ; (<D>, 


3/ 


s>,<0-diaresis>) ;/ 


;/ 


s>,<U-diaresis>) ;/ 


<d>) 5 (<E>,<e>) ;/ 


<F>,<f>) ; (<G>,<g>) ; (<H>,<h>) ; (<I>,<i>) 3 (<J>,<j>)3/ 
<K>,<k>) 3 (<L>,<1>) 3 (<M>,<m>) ; (<N>,<n>) ; (<0>,<o>) ;/ 
<P>,<p>) ; (<Q>,<q>) ; (<R>,<r>) ; (<S>,<s>) 3 (<T>,<t>) 3/ 
<U>, <u>) 3 (<V>,<v>) 3 (<W>,<w>) ; (<X>,<x>) 3 (<Y>,<y>)3/ 


<Z>,<z>) ; (<A-acute>,<a-acute>) ; (<A-grave>, 
<A-circumflex>,<a-circumf]ex>) ; (<A-diaresi 


<a-grave>) ;/ 
s>,<a-diaresis>) ;/ 


<A-tilde>,<a-tilde>) ; (<A-ring>,<a-ring>) ; (<AE>,<ae>) ;/ 


<C-cedilla>,<c-cedilla>) ; (<Eth>,<eth>) ; (<E-acute>,<e-acute>) ;/ 


<E-grave>,<e-grave>) ; (<E-circumflex>,<e-ci 


<I-grave>,<i-grave>) ; (<I-circumflex>,<i-ci 


rcumflex>) ;/ 


rcumflex>) ;/ 


<I-diaresis>,<i-diaresis>) ; (<N-tilde>,<n-tilde>) ;/ 


<0-acute>,<o-acute>) ; (<0-grave>,<o-grave>) 
<0-circumflex>,<o-circumf1ex>) ; (<0-diaresi 


<0-tilde>,<o-tilde>) ; (<0-slash>,<o-slash>) ; (<Thorn>,<thorn>) ;/ 


<U-acute>,<u-acute>) ; (<U-grave>,<u-grave>) 
<U-circumflex>,<u-circumflex>) ; (<U-diaresi 
<Y-acute>,<y-acute>) 


( 
( 
( 
( 
( 
( 
( 
(<E-diaresis>,<e-diaresis>) ; (<Il-acute>,<i-acute>) ;/ 
( 
( 
( 
( 
( 
( 
( 
( 


3/ 


s>,<o-diaresis>) ;/ 


3/ 


s>,<u-diaresis>) ;/ 
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END LC_CTYPE 


LC_COLLATE 


order_start 


<NUL> 
<SOH> 
<STX> 
<ETX> 
<SEL> 
<tab> 
<RNL> 
<DEL> 
<GE> 

<SPS> 
<RPT> 


<vertical-tab> 
<form-feed> 
<carriage-return> 


<S0> 
<SI> 
<DLE> 
<DC1> 
<DC2> 
<DC3> 
<RES> 
<NL> 


<backspace> 


<POC> 
<CAN> 
<EM> 

<UBS> 
<CU1> 
<IS4> 
<IS$3> 
<IS2> 
<IS1> 
<DS> 

<SOS> 
<FS> 

<WUS> 
<BYP> 


<newl ine> 


<ETB> 
<ESC> 
<SA> 

<SFE> 
<SM> 

<CSP> 
<MFA> 
<ENQ> 
<ACK> 


<alert> 


<SYN> 
<[R> 

<PPp> 

<TRN> 
<NBS> 
<EQT> 
<SBS> 
<li> 

<RFF> 
<CU3> 
<DC4> 
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<NAK> 

<SUB> 

<EQ> 

<space> 
<underscore> 
<macron> 

<hyphen> 
<hyphen-minus> 
<comma> 
<semicolon> 
<colon> 

<excl amation-mark> 
<exclamation-down> 
<question-mark> 
<question-down> 
<slash> 

<period> 

<acute> 
<grave-accent> 
<circumflex> 
<diaresis> 

<tilde> 

<dot> 

<cedilla> 
<apostrophe> 
<quotation-mark> 
<guillemot-left> 
<guillemot-right> 
<left-parenthesis> 
<right-parenthesis> 
<left-square-bracket> 
<right-square-bracket> 
<left-brace> 
<right-brace> 
<section> 
<paragraph> 
<copyright> 
<registered> 
<commercial-at> 
<currency> 

<cent> 
<dollar-sign> 
<sterl] ing> 

<yen> 

<asterisk> 
<backslash> 
<ampersand> 
<number-sign> 
<percent-sign> 
<plus-sign> 
<plus-minus> 
<divide> 
<multiply> 
<less-than-sign> 
<equals-sign> 
<greater-than-sign> 
<not> 
<vertical-1ine> 
<broken-bar> 
<degree> 

<mu> 
<nobreakspace> 
<zero> 
<one-quarter> 
<one-hal f> 
<three-quarters> 
<one> 
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<one-superior> 
<two> 
<two-superior> 
<three> 
<three-superior> 
<four> 

<five> 

<six> 

<seven> 
<eight> 

<nine> 

<a> 

<A> 

<a-acute> 
<A-acute> 
<feminine> 
<a-grave> 
<A-grave> 
<a-circumf]ex> 
<A-circumf]ex> 
<a-ring> 
<A-ring> 
<a-diaresis> 
<A-diaresis> 
<a-tilde> 
<A-tilde> 

<ae> 

<AE> 

<b> 

<B> 

<c> 

<C> 
<c-cedilla> 
<C-cedilla> 
<d> 

<D> 

<eth> 

<Eth> 

<e> 

<E> 

<e-acute> 
<E-acute> 
<e-grave> 
<E-grave> 
<e-circumf]ex> 
<E-circumf]ex> 
<e-diaresis> 
<E-diaresis> 
<f> 

<F> 

<g> 

<G> 

<h> 

<H> 
<i-dotless> 
<i> 

<[> 

<i-acute> 
<I-acute> 
<i-grave> 
<I-grave> 
<i-circumf]ex> 
<I-circumf]ex> 
<i-diaresis> 
<I-diaresis> 
<j > 

<J> 
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<k> 

<K> 

<|> 

<L> 

<m> 

<M> 

<n> 

<N> 
<n-tilde> 
<N-tilde> 
<o> 

<0> 

<mascul ine> 
<o-acute> 
<0-acute> 
<o-grave> 
<0-grave> 
<o-circumf]ex> 
<0-circumflex> 
<o-diaresis> 
<0-diaresis> 
<o-tilde> 
<0-tilde> 
<o-slash> 
<0-slash> 
<p> 

<q> 

<Q> 

<> 

<R> 

<s> 

<S> 
<s-sharp> 
<t> 

<T> 

<thorn> 
<Thorn> 

<u> 

<U> 
<u-acute> 
<U-acute> 
<u-grave> 
<U-grave> 
<u-circumf]ex> 
<U-circumf]ex> 
<u-diaresis> 
<U-diaresis> 
<y> 

<\> 

<w> 

<W> 

<xX> 

<X> 

<y> 

<Y> 
<y-acute> 
<Y-acute> 
<y-diaresis> 
<7> 

</> 


UNDEFINED 


order_end 


END LC_COLLATE 
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LC_MONETARY 


int_curr_symbol '"<U><S><D><space>" 
currency_symbol "<dollar-sign>" 
mon_decimal_point "<period>" 
mon_thousands_ sep "<comma>" 
mon_grouping 3 

positive_sign oo 

negative_sign "<hyphen-minus>" 
int_frac_digits 
frac_digits 
p_cs_precedes 
p_sep_by_space 
n_cs_precedes 
n_sep_by_space 
p_sign_posn 
n_sign_posn 
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END LC_MONETARY 


LC_NUMERIC 


decimal_point "<period>" 
thousands_sep "<comma>" 
grouping 3 


END LC_NUMERIC 


LC_TIME 


abday  "<S><u><n>";/ 
"<M><o><n>"; / 
"<T><u><e>"; / 
"<W><e><d>";/ 
"<T><h><u>";/ 
"<Fo<p><i>"3/ 
"<S><a><t>" 


day "<S><u><n><d><a><y>";3 / 
"<M><o><n><d><a><y>"3 / 
"<T><u><e><s><d><a><y>";3 / 
"<W><e><d><n><e><s><d><a><y>"; / 
"<T><h><u><r><s><d><a><y>"3/ 
"<Fo<p><j><d><a><y>"3 / 
"<S><a><t><u><r><d><a><y>" 


abmon = ='"<J><a><n>";/ 
"<F><e><b>"; / 
"<M><a><r>"; / 
"<A><p><r>"; / 
"<M><a><y>"'3 / 
"<J><u><n>"3/ 
"<J><u><]>"3/ 
"<A><u><g>";/ 
"<S><e><p>";/ 
"<Q><c><t>";/ 
"<N><o><v>"; / 
"<D><e><c>" 


mon "<J><a><n><u><a><r><y>"3 / 
"<F><e><b><r><u><a><r><y>"3/ 
"<M><a><r><c><h>"; / 
"<A><p><r><i><1>"3/ 

"<M><a><y>"; / 

"<J><u><n><e>"; / 
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"<J><u><]><y>"3/ 
"<A><u><g><u><s><t>"; / 
"<S><e><p><t><e><m><b><e><r>"; / 
"<Q><c><t><o><b><e><r>"3/ 
"<N><o><v><e><m><b><e><r>"; / 
"<DP><e><c><e><m><b><e><r>" 
d_t_fmt "%a %b %e %H:%M:%S %Z %Y" 
d_fmt = "%m//%d//%y" 

t_fmt = "%H:%M:%S" 

am_pm NeA><M>" 3 "ePp><M>" 

END LC_TIME 

LC_MESSAGES 


yesexpr "[yY] [eE] [sS] | [yY]" 
noexpr "[nN] [00] | [nN]" 
yesstr "yes:y:Y" 

nostr = "no:n:N" 


END LC_MESSAGES 


LC_TOD 

tzdiff 0 

tname i 
dstname "" 
dststart 0,0,0,0 
dstend 0,0,0,0 
dstshift 0 

END LC_TOD 
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Printed in U.S.A. 


